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Fuzzy Overlapping Community Quality Metrics 
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Abstract Modularity is widely used to effectively measure the strength of the 
disjoint community structure found by community detection algorithms. Sev¬ 
eral overlapping extensions of modularity were proposed to measure the quality 
of overlapping community structure. However, all these extensions differ just 
in the way they define the belonging coefficient and belonging function. Yet, 
there is lack of systematic comparison of different extensions. To fill this gap, 
we overview overlapping extensions of modularity and generalize them with a 
uniform definition enabling application of different belonging coefficients and 
belonging functions to select the best. In addition, we extend localized modu¬ 
larity, modularity density, and eight local community quality metrics to enable 
their usages for overlapping communities. The experimental results on a large 
number of real networks and synthetic networks using overlapping extensions 
of modularity, overlapping modularity density, and local metrics show that the 
best results are obtained when the product of the belonging coefficients of two 
nodes is used as the belonging function. Moreover, the results may be used 
to guide researchers on which metrics to adopt when measuring the quality of 
overlapping community structure. 


1 Introduction 

Many networks, including Internet, citation networks, transportation networks, 
email networks, and social and biochemical networks, display community struc- 
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ture which identifies groups of nodes within which connections are denser than 
between them m- Detecting and characterizing such community structure, 
which is known as community detection, is one of the fundamental issues in 
network science. Community detection has been shown to reveal latent yet 
meaningful structure in networks such as groups in online and contact-based 
social networks, functional modules in protein-protein interaction networks, 
groups of customers with similar interests in online retailer user networks, 
groups of scientists in interdisciplinary collaboration networks, etc. m 

In the last decade, the most popular community detection method, pro¬ 
posed by Newman m, has been to maximize the quality metric known as 
modularity |331I32] over all possible partitions of a network. This metric mea¬ 
sures the difference between the fraction of all edges that are within the actual 
community and such a fraction of edges that would be inside the community 
in a randomized graph with the same number of nodes and the same degree 
sequence. It is widely used to measure the strength of community structures 
discovered by community detection algorithms. 

Newman’s modularity can only be used to measure the quality of dis¬ 
joint communities. However, it is more realistic to expect that nodes in real 
networks belong to more than one community, resulting in overlapping com¬ 
munities [39]. Therefore, several overlapping extensions of modularity 1 |431 
[^I571[551 IH [M1[TS] I were proposed to measure the quality of overlapping com¬ 
munity structure. Yet, to date no attempt has been made to systematically 
compare different overlapping extensions and propose metric selection criteria 
for different types of networks. In this paper, we consider several overlapping 
extensions of modularity and test their quality on real and synthetic networks. 
We also extend localized modularity [55], modularity density [5117] . and eight 
local community quality metrics for overlapping communities following the 
same principles used by the overlapping extensions of modularity. 

We conducted experiments on a large number of real-world networks and 
synthetic networks using overlapping extensions of modularity, overlapping 
modularity density, and eight local metrics (the number of Intra-edges, Intra¬ 
density, Contraction, the number of Boundary-edges, Expansion, Conductance 
012], the Fitness function [24], and the Average Modularity Degree [l^). The 
results show that selecting the product of the belonging coefficients of two 
nodes as a belonging function for overlapping extensions yields better results 
on these networks than using other belonging functions. The experimental 
results also give a guidance to researchers on which metrics to choose when 
measuring the quality of overlapping community structure. 

Methodology; Below we introduce in steps the methodology we use to 
evaluate overlapping extensions of modularity: 

(1) We first give a generalized definition for existing overlapping extensions 
of modularity which covers four such extensions, each using one of the 
two different versions of belonging coefficient and one of the two different 
versions of belonging function. 

(2) Next, we extend localized modularity, modularity density, and eight lo¬ 
cal community quality metrics to be applicable to overlapping community 
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structures following the same principle as the overlapping extensions of 
modularity do. 

(3) Then, for the generalized definitions of these metrics, we first determine 
which version of the belonging coefficient and which version of the belong¬ 
ing function perform best for each metric. 

(4) Moreover, we determine which version of the belonging coefficient and 
which version of the belonging function scores the largest number of qual¬ 
ity metrics consistent with each other on determining the best values of 
parameters of compared community detection algorithms: the threshold r 
for SLPA [?D], the parameter k for CFinder [35], and the threshold tr for 
SpeakEasy m on the real and synthetic networks. 

(5) Finally, we compare the performance of the overlapping metrics with the 
best combination of belonging coefficient and belonging function by looking 
at how many times each metric is among those that are consistent with each 
other on determining the best values of parameters for the same algorithms 
as used in step (4). 

This work is an extension of our previous paper [^ which considered four 
real networks and ten community quality metrics for comparison. Also, it only 
adopted one overlapping community detection algorithm, SLPA m, as eval¬ 
uation method. However, in this paper, we consider totally 23 real network 
datasets, including friendship network, collaboration networks, co-purchasing 
networks, biology networks, etc. Table|T]shows the basic properties of all these 
datasets. These networks have different numbers of nodes and different num¬ 
bers of edges, varying from very small to very large networks. Edges in some 
networks have weights and directions. Besides real networks, we also consider 
LFR benchmark networks [23] , each of which is instantiated with a wide range 
of parameters in the experiments. Moreover, we consider two more community 
quality metrics, the Fitness function [24] and Average Modularity Degree [25] . 
totally twelve metrics for comparison. In addition, we adopt three overlapping 
community detection algorithms, SLPA [30], CFinder [35], and SpeakEasy m, 
as testing methods. 

2 Modularity 

2.1 Newman’s Modularity 

Newman’s modularity mm for unweighted and undirected networks is de¬ 
fined as the difference between the fractions of the actual and expected (in a 
randomized graph with the same number of nodes and the same degree se¬ 
quence) number of edges within the community. A larger value of modularity 
means a stronger community structure. For a given community partition of a 
network G = (E, E) with \E\ edges, modularity (Q) [33] is given by: 



( 1 ) 
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where C is the set of all the communities, c is a specific community in C, \E™\ 
is the number of edges between nodes within community c, and is the 

number of edges from the nodes in community c to the nodes outside c. 
Modularity can also be expressed as [32]: 


Q 


1 

W\ 



kikj 


2\E\\ 




( 2 ) 


where ki is the degree of node i, Aij is an element of the adjacency matrix 
between node i and node j, S^^cj is the Kronecker delta symbol, and Ci is the 
label of the community to which node i is assigned. 


2.2 Overlapping Definition of Modularity 


Newman’s modularity is used to measure the quality of disjoint community 
structure of a network. However, it is more realistic that nodes in networks be¬ 
long to more than one community, resulting in overlapping communities [39] . 
For instance, a researcher may be active in several research areas, and a node 
in biological networks might have multiple functions. It is also quite common 
that people in social networks are naturally characterized by multiple commu¬ 
nity memberships depending on their families, friends, professional colleagues, 
neighbors, etc. For this reason, discovering overlapping communities became 
very popular in the last few years. Several overlapping extensions of modular¬ 
ity P51[^[571I551 H1 [MI[TB] were proposed to measure the quality of overlapping 
community structure. These extensions are described below. 

If communities overlap, each node can belong to multiple communities, 
although the strength of this connection can generally be different for different 
communities. Given a set of overlapping communities C = {ci, C 2 ,..., c,..., c\c\} 
in which a node may belong to more than one of them, a vector of belonging 
coefficients (oi^ci, ai,c 2 j aqcid) [^151] can be assigned to each node 

i in the network. \C\ is the number of communities. The belonging coefficient 
Oi^c measures the strength of association between node i and community c. 
Without loss of generality, the following constraints are assumed to hold: 


0 < < 1 Vi G y VcG C 

and 

ai,c = 1 - 

cGC 


( 3 ) 


Therefore, the belonging strength is measured as a real value in the range of 
[0,1] and the sum of all belonging coefficients, which is 1, is the same for all 
nodes in the network. 

Zhang et al. m proposed an extended modularity which uses the average 
of the belonging coefficients of two nodes as belonging function to measure the 
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quality of overlapping community structure: 





(4) 


where \Eir\ = i E. 

\E\ = kJ:z 
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Aij. For the case of disjoint communities, reduces exactly to 
Newman’s modularity {Q) given by Equation pj). 

Nepusz et al. [29] considered the belonging coefficient Ui^c as the probability 
that node i is active in community c. Then, the probability that node i is active 
in the same communities as node j is the dot product of their membership 
vectors, denoted as s,-,-: 


Sin - 


E 

c€C 




(5) 


The authors also adopted Sy as the similarity measure between nodes i and 
j. By replacing in Equation ([Sj) with the similarity measure Sy defined 
above, they proposed a fuzzihed variant of modularity: 
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( 6 ) 


In case communities are disjoint, there exists only one community c for every 
node i for which Oi^c = 1- Then, the fuzzihed modularity (Qoy) reduces to 
exactly the original modularity {Q) described in Equation Q. 

Shen et al. m proposed an extension of modularity for overlapping com¬ 
munity structure using Equation (jh]) but dehned the belonging coefficients of 
node i to be the reciprocal of the number of communities to which it belongs: 


^i,c 


1 


(7) 


where Oi is the number of communities containing node i. Then, the extended 
modularity for overlapping community structure is given by: 
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( 8 ) 


For disjoint community structure, Qfy reduces to the original modularity (Q) 
described in Equation ©■ 
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Shen et al. |38] proposed another extension of modularity for overlapping 
communities also using Equation ([B]). In this case, the coefficient of node i 
belonging to community c is defined as: 


^Z,C - 



(9) 


where Mik denotes the number of maximal cliques in the network containing 
edge (i, fe), is the number of maximal cliques in community c that contains 

edge (i, k), and ai is a normalization term defined as: 


(Xi — 



( 10 ) 


The maximal clique is a clique that is not a subset of any other cliques. Then, 
the extended modularity for overlapping community structure is given by: 
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( 11 ) 


Note that for disjoint communities, this new extension also reduces to New¬ 
man’s modularity shown in Equation (P|). 

Chen et al. [4] also proposed another extension of modularity with the 
same Equation but with the belonging coefficient defined as: 


^i,c 




Z)c'GCi 


Hk 


( 12 ) 


where Ct is the set of communities to which node i belongs. It measures how 
tightly node i connects to community c. Consequently, the extended definition 
of modularity for overlapping community structure is given by: 
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(13) 


Still, for disjoint community structure, reduces to the original modularity 
given by Equation ([2]). 

Unlike the node-based extensions of modularity presented above, Nicosia 
et al. [33] proposed an edge-based extension of modularity for overlapping 
communities. In this case, the belonging coefficients represent how edges are 
assigned to communities. The coefficient for edge I = (*, j) belonging to com¬ 
munity c is l3i(ij)^c = F{o-i,c,o-j,c), where F{ai^c,a'j,c) could be any function 
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(product, average, or maximum) of and Qj^c- After trying several differ¬ 
ent functions, the authors stated that the best A is a two-dimensional logistic 
function: 




1 


(14) 


where /(oi^c) is a simple linear scaling function f{x) = 2px — p,p C R. In 
papers [MIITT] . p was selected to be 30. Then, the expected belonging coeffi¬ 
cient of any edge I = (t, k) starting from node i in community c is given by 
Pi(ik)c ~ SfeGV I'unning over all nodes in the network. Accord¬ 

ingly, the expected belonging coefficient of any edge I = {k,j) pointing to 
node j in community c is defined as ^ j) c ~ ]V^ Thkev Then, the 

edge-based extension of modularity is given by: 


where 

and 


Qov 2 \E\ ^ ^ 


2\E\ 


cGC ijGc 


_ hkj 

^ijc 


EE 

c€C ijGc *- 




2\E\\ 

l^l{i,k),c^il^l{k,j),c^j 


2\E\ 


‘^ijc — — A((22^c 5 a^'^c) 


(15) 


(16) 


Sijc — Pl(i^k),cPl{k,i),c 

_ Sfcgy Pl(i,k),c X^fcgy Pl(k,j),c 

|1^|2 (17) 

_ X/fegF R{o,i,c^ 0,k,c) X/fcgy Riflk,c, ^j,c) 

- ■ 

In (5 q„, Xijc is used as the weight corresponding to the probability of the 
observed edge I = while s^c is used as the weight of the probability 

of an edge from node i to node j in the null model. Note that although for 
disjoint communities F{ai^c,o.j,c) is practically 1 when both and aj^c are 
equal to 1, Qou does not exactly reduce to the original modularity given by 
Equation @- 

Generally, there are two categories of overlapping community structures: 
crisp (non-fuzzy) overlapping and fuzzy overlapping [TS] . For crisp overlapping 
community structure, each node belongs to one or more communities but with¬ 
out the corresponding belonging coefficients. That is, the relationship between 
a node and a community is binary: a node either belongs to a community or 
it does not. For fuzzy overlapping community structure, each node can be a 
member of multiple communities, but in general the values of belonging co¬ 
efficients are different. Fuzzy overlapping can be easily transformed to crisp 
overlapping with a threshold parameter. Namely, if the belonging coefficient 
of node i to community c is larger than the value of the threshold, then node 
i stays in community c. Otherwise, node i is deleted from community c. Crisp 
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overlapping can be converted to fuzzy overlapping by calculating the value of 
the belonging coefficient using Equations ©, dH), or dni). However, calculat¬ 
ing the belonging coefficient using Equation is computationally expensive 
since it needs to find all the maximal cliques of the network first. Hence, in 
this paper we only consider Equation o and Equation m when converting 
crisp overlapping to fuzzy overlapping. 

Now, we give two general definitions, Qov and Qgy, for node-based exten¬ 
sions of modularity. First, Qov is given by: 


Qov - 
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ceC 



f2\El^\ + \Er*\y 

_ \E\ 
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and \E\ = The belonging function f{ai^c,0'j,c) can be the average 

or product of and aj^c- That is, f{ai^c,aj,c) = or f{ai^cTaj,c) = 

ai,cO'j,c- Clearly, Qov with f{ai^c,CLj,c) = is very similar to Q^v in 

Equation (|3|). Second, Q'^y is given by: 
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where f{ai^c, o-j,c) is the same as that in Equation (I18|) . It is worth noting that 
Qoy with the belonging function a,j,c) = ai,cny,c is actually the same as 

Qoy in Equation ([6l), Q^y in Equation Q^y in Equation (HU, and Q^y in 
Equation (TT^ . The only difference between these formulas is how the value of 
Ui^c is calculated. 

It is easy to prove that Qov is equivalent to Q'oy when f{ai^c, CLj,c) = ai,caj,c- 
From the definition of Qov, we know that \E™\ = ^ jec which 

is in fact the same as the first term of Q'oy Moreover, it is easy to show that 
(2|E“| -I- Hence, the second term of Qov is the 

same as the second term of Q'ov- Similarly, it can be shown that Qov is not 
equal to Q'oy when f{ai^c,aj^c) = 2 °"'° ■ 


2.3 Localized Modularity Based on Community’s Neighborhood 

Newman’s modularity is a global measure which assumes that all pairs of nodes 
have equal probability to connect with each other, which reflects the connec¬ 
tivity among all communities. However, Muff et al. |28j argued that in many 
complex networks most communities are connected to only a small fraction of 
remaining communities, called local cluster connectivity property. Thus, they 
modified the definition of Newman’s modularity by taking into account local 
cluster connectivity only to overcome global network dependency. The result¬ 
ing measure is called localized modularity. We denote the localized modularity 
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here as NQ because it is based on the neighborhood of a community. NQ for 
unweighted and undirected networks is given by: 



( 20 ) 


where is the total number of edges in the subnetwork containing the 

community c and all its neighboring communities, i.e. the neighborhood of 
community c. It means that the contribution of each community c to NQ is 
calculated based on the neighborhood of c. 

Unlike the traditional modularity (Q), the localized version of modularity 
(NQ) is not bounded above by 1. The more locally connected communities a 
network has, the bigger its NQ can grow. In a network where all communities 
are connected to each other, NQ yields the same value as Q. NQ considers 
individual communities and their neighbors, and therefore provides a measure 
of community quality that is not dependent on other parts of the network. 

Similar to the general node-based overlapping definition of modularity 
Qov in Equation ([T8l) . we extend NQ for overlapping community structure as 
NQov The formula for NQov is exactly the same with NQ in Equation (I20|) . 
while the difference is that |E*”|, and in NQov should consider 

the belonging coefficients of nodes and also the belonging function between 
pairs of nodes. For disjoint communities, NQov reduces exactly to NQ. 


3 Modularity Density 

3.1 Modularity Density for Disjoint Communities 

Chen et al. [5117115] proposed modularity density which simultaneously ad¬ 
dresses two opposite yet coexisting problems of Newman’s modularity: in some 
cases, it tends to favor small communities over large ones while in others, large 
communities over small ones. The latter tendency is known in the literature as 
the resolution limit problem of modularity [12] . Modularity density mixes two 
additional components, split penalty and the community density, into New¬ 
man’s modularity given in Equation ©• Split penalty is the fraction of edges 
that connect nodes of different communities. Community density includes in¬ 
ternal community density and pair-wise community density. The definition of 
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modularity density {Qds) for unweighted and undirected networks is given by: 
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( 21 ) 


where dc is the internal density of community c, and dc^d is the pair-wise 
density between community c and community c'. 


3.2 Modularity Density for Overlapping Communities 

According to Qov in Equation (fl^ . we extend Qds for overlapping community 
structure as: 
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where |A“| = ^ Ei,yGcai.c)Ay , = E E \Ec,d\ = 


i^c c'gC 
cQc 
jec' 


Eiec,iec' aj^d)Aij, a.nd |A| = i E The belonging function f{ai^c, aj^c) 


can be the product or average of and Qj^c- For disjoint communities, 
reduces exactly to Qds given by Equation (ED). Notice that we do not ex¬ 
tend modularity density based on Q^^ since it is too complicated and far from 
intuitive. 


4 Evaluation and Analysis 

From Subsection we know that all node-based overlapping extensions 
of modularity can be expressed with Qov in Equation m using the be¬ 
longing function f{ai^c,aj^c) = or f{ai^c,aj,c) = ai^caj,c- For the 
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edge-based overlapping extension of modularity {Qov)i the belonging func¬ 
tion is given by Equation m- Also, the overlapping extension of the lo¬ 
calized modularity NQov has the belonging function f{ai^c,cLj,c) = 
or f{ai^c,aj,c) = 0'i,caj,c- For the overlapping extension of modularity den¬ 
sity (<52"), the belonging function f{ai^c,ctj^c) can also be the average or 
the product of at^c and aj^c- Thus, there are two versions of the belonging 
function for Qov, NQov, and Therefore, we have Qov{average) with 

f{ai^c,aj^c) = ) Qoviproduct) with f{ai^c,aj^c) = ai^caj^c, Qov in Equa¬ 
tion (fT51) . NQov{average) with f{ai^c,a,j,c) = , NQov (product) with 

/(ui.c, oyc) = ai^caj,c, Q°Z(°‘'^^'^^9e) with f(ai^c, Oj^c) = ^ a-od Q^iproduct) 

with f(ai^c, a,j^c) = ai^cO-j,c- For fuzzy overlapping community structures, Ui^c is 
given for each node i to each community c to which it belongs. For crisp over¬ 
lapping community structures, we can adopt Equation o and Equation m 
to calculate Ui^c- Consequently, two versions of the belonging coefficient can 
be used to convert crisp overlapping to fuzzy overlapping. 

We also consider eight local community quality metrics: the number of 
Intra-edges, Intra-density. Contraction, the number of Boundary-edges, Expan¬ 
sion, Conductance IHIIZ], the Fitness function |24j . and the Average Modularity 
Degree [25j . These metrics describe how the connectivity structure of a given 
set of nodes resembles a community. All of them rely on the intuition that com¬ 
munities are sets of nodes with many edges inside them and few edges outside 
of them. We also extend these metrics to be applicable to fuzzy overlapping 
community structures in which nodes are assigned probability of belonging to 
each community of which they are part. Two versions of the belonging coeffi¬ 
cient and two versions of the belonging function are considered for each metric. 

For fuzzy overlapping community structure, we define the size of a community 
c as |c| = 

The number of Intra-edges (IE): it is the total number of edges 

in c. A large value of this metric is better than a small value in terms of the 
community quality. 

Intra-density (ID): dc in Equation (1^^ . The larger the value of this metric, 
the higher the quality of the communities. 

Contraction (CNT): 2|E™|/|c|; it measures the average number of edges 
per node inside the community c. A larger value of contraction means a better 
community quality. 

The number of Boundary-edges (BE): it is the total number of 

edges on the boundary of c. A small value of this metric is better than a large 
value in terms of the community quality. 

Expansion (EXP): |i?°“*|/|c|; it measures the average number of edges (per 
node) that point outside the community c. A smaller value of expansion cor¬ 
responds to a better community structure. 

Conductance (CND): 2 |g-"|+|_B°“q ’ measures the fraction of the total 
number of edges that point outside the community. A smaller value of con¬ 
ductance means a better community quality. 

The Fitness (F) function: ; it is the ratio between the internal 
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degree and the total degree of a community c. A larger value of F indicates a 
better community quality. 

Average Modularity Degree (D): X^cgC ° |^| ° it is the summation 
of the average modularity degree of each community. The average modular¬ 


ity degree of a community 


f 2\ET\-\ET^\ 
y Id 


) equals to the average inner degree 


21^‘j‘M'l I -ppout I 

( 1 ^ 1 ' ) minus the average outer degree ( ' | ° | ' ). The larger the value of D, 


the higher the quality of the community structure. 

In this section, we compare different choices of the belonging coefficient 
and the belonging function to be used for Qov, Qov^ ^Qov, <32^, and the eight 
local community quality metrics in order to see which version of the belonging 
coefficient and which version of the belonging function are better. Then, we 
try to determine which of the three overlapping extensions of modularity (two 
kinds of node-based extensions of modularity and the edge-based extension of 
modularity) is the best. In addition, we compare the performance of all these 
overlapping metrics with the best combination of belonging coefficient and 
belonging function to recommend which metrics to select when measuring the 
quality of overlapping community structures. 

The experiments are done with three community detection algorithms. 
Speaker-listener Label Propagation Algorithm (SLPA) [?D], Clique Percola¬ 
tion Method (CFinder) [35], and SpeakEasy [13] which is a label propagation 
algorithm specialized for biology networks, on a large number of real networks 
and synthetic networks. We vary the threshold parameter r of SLPA 001 from 
0.05 to 0.5 with step 0.05. SLPA gets crisp overlapping communities when 
r < 0.5 and gets disjoint communities when r = 0.5 (for r > 0.5 SLPA gener¬ 
ates the same disjoint communities as for r = 0.5). For each value of threshold 
r, we adopt 10 running samples since the community detection result of SLPA 
is not deterministic. We vary the parameter k of CFinder from 3 to 20 with 
step 1 but only when such fc-clique-community is available. It is usually the 
case that some nodes are not in the final discovered fc-clique-communities so 
we consider each of these nodes forming a community of itself. The threshold 
parameter tr of SpeakEasy is varied from 0.05 to 1 with step 0.05. SpeakEasy 
gets crisp overlapping community structures when tr < 1 and gets disjoint 
community structures when tr = 1. 

Then, for the community detection results of SLPA with different values 
of threshold r, the results of CFinder with different values of fc, and the re¬ 
sults of SpeakEasy with different thresholds tr on each of these networks, we 
calculate the values of Qov, Qov, ^Qov Q°ds, the eight local community 
quality metrics (twelve metrics in total) with two versions of the belonging 
coefficient (BC) and two versions of the belonging function (BF). For 
each r of SLPA, the values of all the metrics are calculated as the average of 
the 10 runs. For convenience, we denote Equation © and Equation CU) as 
the first and the second version of the belonging coefficient, respectively. We 

as the first version of 


c+a 7,, 


also denote the belonging function /(oqc, Oj.c) = 
the belonging function and /(oqcOj.c) = aqcOj.c as the second version of the 
belonging function. We determine which version of the belonging coefficient 
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Table 1 Basic properties of all real network datasets used in the experiments. 


Name 

#Nodes 

#Edges 

Type 

Description 

Celegans 

453 

2025 

Unweighted Sz Undirected 

Metabolic network of G. elegans 1101 

Dolphin 

62 

159 

Unweighted Undirected 

Dolphin social network 1261 

Email 

1133 

5451 

Unweighted Undirected 

Email network 1191 

Football 

115 

613 

Unweighted & Undirected 

American college football network 1151 

Jazz 

198 

2742 

Unweighted Undirected 

Jazz musicians network 1161 

Karate 

34 

78 

Unweighted Sz Undirected 

Zachary’s karate club network 1421 

Lesmis 

77 

254 

Weighted Sz Undirected 

Gharacters coappearance network 1211 

Netscience 

1461 

2742 

Weighted Sz Undirected 

Goauthorship network 1311 

PGP 

10680 

24316 

Unweighted Sz Undirected 

PGP network [2] 

Polblogs 

1224 

19022 

Unweighted Sz Directed 

Political blogs network ^ 

Polbooks 

105 

441 

Unweighted Sz Undirected 

Network of books about US politics 1221 

Railway 

297 

1213 

Unweighted Sz Undirected 

Indian railway network 

Santafe 

118 

200 

Unweighted Sz Undirected 

Gollaboration network of scientists 1151 

Collins.cyc 

1097 

6392 

Unweighted Sz Undirected 


Collins cyc w 

1097 

6392 

Weighted Sz Undirected 


Collins mips 

734 

4778 

Unweighted Sz Undirected 


Collins sgd 

809 

2955 

Unweighted Sz Undirected 

Protein-protein interaction networks |13| 

Gavin cyc 

997 

4031 

Unweighted Sz Undirected 

Gavin cyc w 

997 

4031 

Weighted Sz Undirected 


Gavin mips 

701 

2695 

Unweighted Sz Undirected 


Gavin sgd 

747 

2639 

Unweighted Sz Undirected 


Amazon 

319948 

880215 

Unweighted Sz Undirected 

Amazon product network 1411 

DBLP 

260998 

950059 

Unweighted Sz Undirected 

DBLP collaboration network 1411 


and which version of the belonging function are better based on the largest 
number of quality metrics consistent with each other on determining the best 
value of threshold r for SLPA, the best value of parameter k for CFinder, 
and the best value of threshold tr for SpeakEasy on all these networks. Fi¬ 
nally, we compare the performance of these overlapping metrics with the best 
combination of belonging coefficient and belonging function by looking at how 
many times each metric is among those that are consistent with each other on 
determining the best values of parameters for the three adopted community 
detection algorithms. 


4.1 Real Network Datasets 

We consider totally 23 real network datasets, including friendship network, col¬ 
laboration networks, co-purchasing networks, biology networks, etc. Table [T] 
shows the basic properties of all these datasets. It can be seen that these net¬ 
works have different numbers of nodes and different numbers of edges, varying 
from very small networks to very large networks. Moreover, edges in some 
networks have weights and directions. 

Celegans is a metabolic network of C. elegans [TU]. Dolphin is an so¬ 
cial network of frequent associations between 62 dolphins in a community 
living off Doubtful Sound, New Zealand [5^. Email is a network of email 
interchanges between members of the Univeristy Rovira i Virgili (Tarrag- 
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ona) [TO]. Football is a network that represents the schedule of games be¬ 
tween college football teams in a single season m- Jazz is a network of 
collaborations between jazz musicians |16j . Karate is a network represent¬ 
ing the friendships between 34 members of a karate club at a US university 
during two years [TO]- Lesmis is a coappearairce network of characters in 
the novel Les Miserables m- Netscience is a coauthorship network of sci¬ 
entists working on network theory aird experiment [3I]- PGP is the giant 
component of the network of users of the Pretty-Good-Privacy (PGP) algo¬ 
rithm for secure information interchange (2). Polblogs is a directed network 
of hyperlinks between weblogs on US politics, recorded in 2005 by Adamic 
and Glance [T]. Polbooks is a network of books about US politics published 
around the time of the 2004 presidential election and sold by the online book¬ 
seller Amazon.com |22j . Railway is a network with nodes representing In¬ 
dian railway stations, where two stations are connected by an edge if there 
exists at least one train-route such that both stations are scheduled stops 
on that route |5|. Santafe is the largest connected component of the col¬ 
laboration network of scientists at the Santa Fe Institute during years 1999 
and 2000 [TO]. Collins_cyc, Collins_cyc_w, Collins_mips, Collins_sgd, 
Gavin_cyc, Gavin_cyc_w, Gavin_mips, and Gavin_sgd are two kinds (re¬ 
ferred as Gollins [S] and Gavin [TO] here) of popular high throughput protein- 
protein interaction networks derived from measurements obtained by affinity 
purihcation and mass spectrometry (AP-MS) techniques |13| . These two kinds 
of networks are further refined with three gold-standards for protein complexes, 
including the classic Munich Information Genter for Protein Sequences (MIPS) 
m and the more recent Saccharomyces Genome Database (SGD) [20]. The 
complete MIPS dataset as well as partial information from SGD are incor¬ 
porated into a third protein complex list known as GYG2008 [30]. Thus, we 
have Collins_cyc, Collins_mips, Collins_sgd, Gavin_cyc, Gavin_mips, 
and Gavin_sgd, respectively. Gollins_cyc_w and Gavin_cyc_w are respec¬ 
tively the weighted versions of Collins_cyc and Gavin_cyc, in which the 
weight is proportional to the probability a given interaction pair truly exists. 
Amazon is a product co-purchasing network of the Amazon website [JT] . The 
nodes of the network represent products and edges link commonly copurchased 
products. DBLP is a scientihc collaboration network where nodes represent 
authors and edges connect authors that have co-authored a paper [41]. 

Tables |2]|4] show the best value of threshold r for SLPA, the best value of 
parameter k for GFinder, and the best value of threshold tr for SpeakEasy, 
respectively, along with the corresponding number of community quality met¬ 
rics (out of twelve) that are consistent with each other on determining this 
best r, this best k, and this best tr for the four combinations of two versions 
of belonging coefficient and two versions of belonging function on all 23 real 
network datasets. For instance, entry 0.5 (5) in row Dolphin and column 
(BC,BF) = (1,1) in Table [2] means that there are totally five out of twelve 
community quality metrics showing that the best value of threshold r for SLPA 
is 0.5 when adopting the first version of belonging coefficient and the first ver¬ 
sion of belonging function. The red italic font in each dataset row of these 
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Table 2 The best value of threshold r for SLPA and the corresponding number of commu¬ 
nity quality metrics (out of twelve) that are consistent with each other on determining this 
best r for the four combinations of two versions of belonging coefficient and two versions of 
belonging function on all real network datasets. The best value in each row is marked by 
red italic font. 



SLPA 

Datasets 

(BC,BF)=(1,1) 

(BC,BF)=(1,2) 

(BC,BF)=(2,1) 

(BC,BF)=(2,2) 

Celegans 

0.5 (6) 

0.05 (4) 

0.5 (6) 

0.4 (5) 

Dolphin 

0.5 (5) 

0.4 (8) 

0.5 (6) 

{0.05,0.4} (5) 

Email 

0.5 (8) 

0.5 (10) 

0.5 (7) 

0.5 (8) 

Football 

{0.45,0.5} (9) 

{0.45,0.5} (11) 

{0.45,0.5} (8) 

0.25 (7) 

Jazz 

0.5 (10) 

0.5 (10) 

0.5 (9) 

0.5 (9) 

Karate 

0.5 (8) 

0.45 (10) 

0.5 (7) 

0.45 (10) 

Lesmis 

{0.25,0.5} (4) 

0.25 (6) 

0.5 (4) 

0.15 (5) 

Netscience 

0.35 (4) 

0.5 (4) 

0.35 (5) 

{0.35,0.5} (3) 

PGP 

0.5 (8) 

0.5 (10) 

0.5 (7) 

0.5 (8) 

Polblogs 

0.5 (7) 

0.5 (8) 

0.5 (6) 

0.5 (4) 

Polbooks 

0.5 (7) 

0.2 (5) 

0.5 (7) 

0.2 (4) 

Railway 

0.5 (8) 

0.5 (9) 

0.5 (7) 

0.5 (7) 

Santafe 

0.4 (6) 

0-4 (7) 

0.4 (5) 

0.4 (5) 

Collins.cyc 

0.5 (8) 

0.5 (5) 

0.5 (8) 

0.5 (6) 

Collins cyc w 

0.05 (7) 

0.05 (7) 

0.05 (6) 

0.05 (8) 

Collins mips 

0.45 (5) 

0.45 (6) 

{0.05,0.45} 4 

{0.05,0.45} (4) 

Collins sgd 

0.5 (4) 

0.1 (4) 

0.5 (4) 

0.1 (6) 

Gavin.cyc 

0.45 (5) 

0.45 (6) 

0.45 (5) 

0.45 (7) 

Gavin cyc w 

0.35 (5) 

0.3 (6) 

0.05 (4) 

0.3 (5) 

Gavin mips 

0.5 (9) 

0.5 (11) 

0.5 (8) 

0.5 (6) 

Gavin sgd 

0.5 (8) 

0.5 (9) 

0.5 (7) 

0.5 (7) 

Amazon 

0.5 (8) 

0.5 (10) 

0.5 (6) 

0.5 (9) 

DBLP 

0.5 (7) 

0.5 (10) 

0.5 (6) 

0.5 (8) 


tables denotes the best combination of the two versions of belonging coeffi¬ 
cient and two versions of belonging function for each dataset. For example, 
0.4 (8) in row Dolphin and column (BC,Br) = (l,2) in Table [2] indicates 
that the twelve metrics with the first version of belonging coefficient and the 
second version of belonging function is the best since the number of metrics 
(which is eight here) consistent with each other on determining the best value 
of r is the largest among the four combinations. 

It can be observed from Table [2] that almost all the networks, except Cel- 
egans, Netscience, Polbooks, Collins_cyc, Collins_cyc_w, Collins_sgd, 
and Gavin_cyc, imply that (BC,BF)=(1,2) is the best among the four possi¬ 
ble combinations when using SLPA. Table [3] shows that (BC,BF)=(1,2) is the 
best on all the networks, except Football, when using CFinder. Results for 
CFinder on Polblogs, Collins_cyc_w, and Gavin_cyc_w are not provided 
because it has not Hnished running on these three networks for more than 
two months processing many potential /c-cliques. Similarly, Table 0] indicates 
that (BC,BF)=(1,2) is the best on all the networks, except Jazz, when using 
SpeakEasy. We can observe from the three tables that for each network there 
are at least two out of three algorithms (SLPA, CFinder, and SpeakEasy) sup- 
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Table 3 The best value of parameter k for CFinder and the corresponding number of 
community quality metrics (out of twelve) that are consistent with each other on determining 
this best k for the four combinations of two versions of belonging coefficient and two versions 
of belonging function on all real network datasets. The best value in each row is marked by 
red italic font. 



CFinder 

Datasets 

(BC,BF)=(1,1) 

(BC,BF)=(1,2) 

(BC,BF)=(2,1) 

(BC,BF)=(2,2) 

Celegans 

3 (6) 

3 (7) 

3 (5) 

{3,9} (4) 

Dolphin 

3 (11) 

3 (12) 

3 (7) 

3 (7) 

Email 

3 (10) 

3 (10) 

4 (4) 

{3,4,9-12} (3) 

Football 

i (6) 

{3,4} (5) 

4 (5) 

4 (5) 

Jazz 

3 (8) 

3 (8) 

3 (6) 

3 (5) 

Karate 

3 (11) 

3 (11) 

3 (8) 

3 (7) 

Lesmis 

3 (7) 

3 (8) 

6 (4) 

{3,6} (4) 

Netscience 

3 (11) 

3 (12) 

3 (9) 

3 (9) 

PGP 

3 (10) 

3 (12) 

3 (6) 

3 (8) 

Polblogs 

N/A 

N/A 

N/A 

N/A 

Polbooks 

3 (10) 

3 (10) 

3 (7) 

3 (6) 

Railway 

3 (9) 

3 (9) 

3 (4) 

3 (4) 

Santafe 

3 (10) 

3 (11) 

3 (7) 

3 (8) 

Collins.cyc 

3 (11) 

3 (11) 

3 (7) 

3 (8) 

Collins cyc w 

N/A 

N/A 

N/A 

N/A 

Collins mips 

3 (10) 

3 (10) 

3 (6) 

3 (6) 

Collins sgd 

3 (12) 

3 (12) 

3 (8) 

3 (8) 

Gavin.cyc 

3 (11) 

3 (12) 

3 (6) 

3 (6) 

Gavin cyc w 

N/A 

N/A 

N/A 

N/A 

Gavin mips 

3 (12) 

3 (12) 

3 (7) 

3 (6) 

Gavin sgd 

3 (11) 

3 (11) 

3 (6) 

3 (6) 

Amazon 

3 (11) 

3 (11) 

3 (6) 

3 (6) 

DBLP 

3 (9) 

3 (10) 

3 (5) 

3 (5) 


porting conclusion that (BC,BF)=(1,2) is the best. Thus, we determined that 
the first version of the belonging coefficient is better than the alternative. It 
means that to convert crisp overlapping to fuzzy overlapping the belonging 
coefficient of a node to a community should be the reciprocal of the number 
of communities of which this node is a part. When the relationship between 
a node and the communities to which it belongs is binary, there is no infor¬ 
mation about the strength of the membership. In this case, it is intuitive and 
reasonable to assign a node to its communities using equal belonging coeffi¬ 
cients. We also determined that the second version of the belonging function 
is better than the alternative. It means that the probability of the event that 
two nodes belong to the same community should be the product, not the aver¬ 
age, of their belonging coefficients to that community. In addition, 
when /(oi^c) = cii^cCij,c as proved in Subsection 12.21 which is another way 

of showing that the second version of the belonging function is much more 
suitable for use in the metric than the first. Therefore, we conclude that the 
overlapping community quality metrics with the first version of the belonging 
coefficient and the second version of the belonging function are the best among 
the four possible combinations on all the real network datasets. Tables 1-23 in 
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Table 4 The best value of threshold tr for SpeakEasy and the corresponding number of 
community quality metrics (out of twelve) that are consistent with each other on determining 
this best tr for the four combinations of two versions of belonging coefficient and two versions 
of belonging function on all real network datasets. The best value in each row is marked by 
red italic font. 



SpeakEasy 

Datasets 

(BC,BF)=(1,1) 

(BC,BF)=(1,2) 

(BC,BF)=(2,1) 

(BC,BF)=(2,2) 

Cclegans 

0.75 (6) 

0.75 (7) 

0.75 (6) 

0.75 (6) 

Dolphin 

0.4 (3) 

0.4 (4) 

0.15 {4) 

0.7 {4) 

Email 

0.9 (3) 

1 (3) 

{0.05,0.9} (2) 

{0.5,1} {3) 

Football 

0.6 {10) 

0.6 {10) 

0.6 {10) 

0.6 {10) 

Jazz 

0.75 (5) 

0.75 (5) 

0.75 (5) 

0.75 {6) 

Karate 

0.45 (5) 

0.45 {6) 

0.45 (5) 

0.45 {6) 

Lesmis 

0.85 (5) 

0.85 {6) 

0.85 (4) 

0.85 (4) 

Netscience 

0.7 (4) 

0.25 {10) 

0.05 (3) 

{0.15,0.2,0.25,0.7} (2) 

PGP 

0.85 (5) 

0.85 (7) 

{0.05,0.75,0.85} (3) 

0.85 (4) 

Polblogs 

0.7 U) 

0.7 {4) 

0.45 (3) 

{0.5,0.7,0.8,0.9} (2) 

Polbooks 

0.95 (5) 

0.95 {6) 

{0.5,0.95} (3) 

0.95 (4) 

Railway 

0.8 (4) 

0.8 {5) 

0.8 (3) 

0.8 (4) 

Santafe 

0.9 (5) 

0.9 {5) 

0.9 (4) 

{0.65,0.9} (4) 

Collins.cyc 

0.9 (8) 

0.9 {10) 

0.9 (4) 

0.9 (6) 

Collins cyc w 

0.55 (6) 

0.55 {9) 

{0.1,0.55} (3) 

0.55 (4) 

Collins mips 

0.4 (7) 

0.4 { 10 ) 

0.4 (5) 

{0.25,0.4,0.6} (3) 

Collins sgd 

0.8 (7) 

0.8 {10) 

{0.5,0.8} (4) 

{0.5,0.8} (5) 

Gavin.cyc 

0.7 (6) 

0.7 {8) 

0.7 (5) 

0.7 (6) 

Gavin cyc w 

0.7 (4) 

0.7 {5) 

{0.05,0.5,0.7,0.95} (2) 

0.7 (3) 

Gavin mips 

0.9 (6) 

0.9 {8) 

0.9 (5) 

0.9 (5) 

Gavin sgd 

0.8 (5) 

0.8 {6) 

0.8 {6) 

0.8 {6) 

Amazon 

1 (9) 

1 {11) 

1 (8) 

1 {11) 

DBLP 

1 (9) 

1 {10) 

1 (8) 

1 {10) 


Supplementary Materials contain more detailed results on these real network 
datasets. 

Table [ 5 ] shows the number of times (maximum three since there are totally 
three community detection algorithms adopted) that the community quality 
metric is among the quality metrics that are consistent with each other on 
determining the best value of threshold r for SLPA, the best value of param¬ 
eter k for CFinder, and the best value of threshold tr for SpeakEasy on each 
real network with (BC,BF)=(1,2). The last row in the table presents on how 
many networks this metric got support from all three algorithms and how 
many times on average (maximum three; but for Polblogs, Collins_cyc_w, 
and Gavin_cyc_w the maximum value is two because CFinder has no result 
for these networks) this metric got support from each of the 23 real networks. 
This table indicates that the edge-based overlapping definition is the best 
overlapping extension for modularity among the three extensions (two kinds 
of node-based extensions of modularity and the edge-based extension of modu¬ 
larity) . The fitness function gets the largest values in the last row and generally 
local metrics are better than global metrics here which may imply that local 
community quality metrics are more applicable for measuring the quality of 




































18 


Mingming Chen, Boleslaw K. Szymanski 


Table 5 The number of times (maximum three since there are totally three community 
detection algorithms adopted) that the community quality metric is among those consistent 
with each other on determining the best value of threshold r for SLPA, the best value of 
parameter k for CFinder, and the best value of threshold tr for SpeakEasy on each real 
network with (BC,BF)=(1,2). The best value in the last row is marked by red italic font. 


Datasets 

Qov 

NQov 

^OV 

qd: 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 

Celegans 

1 

1 

1 

1 

3 

0 

2 

0 

2 

2 

3 

2 

Dolphin 

3 

2 

3 

2 

2 

2 

1 

1 

2 

2 

1 

3 

Email 

2 

2 

3 

2 

3 

1 

1 

1 

2 

2 

2 

2 

Football 

2 

1 

3 

3 

3 

2 

3 

2 

3 

3 

3 

3 

Jazz 

2 

2 

2 

2 

2 

1 

1 

3 

2 

2 

2 

2 

Karate 

3 

3 

3 

3 

2 

1 

2 

1 

3 

2 

2 

2 

Lesmis 

0 

2 

1 

0 

2 

2 

3 

0 

2 

3 

3 

2 

Netscience 

3 

2 

3 

2 

3 

1 

1 

3 

2 

2 

2 

2 

PGP 

3 

2 

3 

2 

2 

1 

3 

2 

3 

3 

2 

3 

Polblogs 

1 

1 

1 

1 

1 

0 

1 

1 

1 

1 

1 

2 

Polbooks 

2 

1 

2 

1 

2 

2 

1 

0 

2 

3 

3 

2 

Railway 

1 

1 

1 

1 

2 

1 

3 

2 

2 

3 

3 

3 

Santafe 

2 

2 

2 

1 

1 

2 

3 

0 

2 

3 

3 

2 

Collins.cyc 

3 

2 

3 

1 

2 

3 

2 

2 

2 

2 

2 

2 

Collins cyc w 

2 

1 

2 

0 

2 

0 

1 

1 

2 

2 

2 

1 

Collins mips 

2 

3 

3 

1 

2 

2 

2 

1 

3 

2 

2 

3 

Collins^gd 

2 

2 

2 

1 

3 

2 

3 

1 

2 

3 

3 

2 

Gavin.cyc 

2 

2 

2 

2 

2 

1 

1 

2 

3 

3 

3 

3 

Gavin cyc w 

0 

0 

0 

1 

1 

0 

2 

0 

1 

2 

2 

2 

Gavin mips 

3 

2 

3 

2 

2 

2 

3 

2 

3 

3 

3 

3 

Gavin sgd 

2 

3 

2 

1 

2 

2 

3 

1 

2 

3 

3 

2 

Amazon 

3 

3 

3 

3 

3 

2 

3 

1 

3 

2 

3 

3 

DBLP 

2 

2 

3 

3 

3 

2 

2 

1 

3 

3 

3 

3 


8 

(2) 

4 

(1.83) 

12 

(2.22) 

4 

(1.57) 

8 

(2.17) 

1 

(1.39) 

10 

(2.04) 

2 

(1.22) 

9 

(2.26) 

13 

(2.43) 

U 

(2.43) 

11 

(2.35) 


overlapping community structures. Note that eight out of twelve metrics are 
local metrics, so they are majority here which may impact the selection of the 
most consistent metrics. In addition, even though we have already shown in [51 
nm that modularity density simultaneously solves two opposite yet coexisting 
problems of modularity, is not highly consistent with majority of other 
metrics. This might be because only one other metric consider community 
density. We will investigate if adding more community density based metrics 
can change the above results. 


4.2 LFR Benchmark Networks 

LFR (named after the initials of names of authors) benchmark networks [23] 
have become a standard in the evaluation of the performance of community 
detection algorithms. The LFR benchmark network that we used here has 
1000 nodes with average degree 15 and maximum degree 50. The exponent 7 
for the degree sequence varies from 2 to 3. The exponent P for the commu¬ 
nity size distribution ranges from 1 to 2. Then, four pairs of the exponents 
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Table 6 The best value of threshold r for SLPA and the corresponding number of commu¬ 
nity quality metrics (out of twelve) that are consistent with each other on determining this 
best r for the four combinations of two versions of belonging coefficient and two versions of 
belonging function on LFR benchmark networks with (q:,/ 3) = (1,2) and (i = 0.3,0.35,0.4. 
The best value in each row is marked by red italic font. 



SLPA 


Om 

(BC,BF)=(1,1) 

(BC,BF)=(1,2) 

(BC,BF)=(2,1) 

(BC,BF)=(2,2) 

0.3 

1 

0.5 (7) 

0.5 (8) 

0.5 (7) 

0.5 (6) 

2 

0.5 (8) 

0.5 (5) 

0X(7) 

074 ] 

4 

0.5 (9) 

0.5 (10) 

0.5 (8) 

0.5 (5) 

6 

0.5 (9) 

0.5 {10) 

0.5 (8) 

0.5 (6) 

8 

0.5 (9) 

0.5 (10) 

0.5 (8) 

0.5 (9) 

0.35 

1 

0.25 (5) 

0.25 (6) 

0.25 (4) 

0.25 (5) 

2 

mrjT) 

0.45 (7) 

mrjT) 

0.45 (6) 

4 

0.5 (8) 

0.5 (9) 

0177 ) 

0.5 (6) 

6 

0.5 (8) 

0.5 (9) 

0177 ) 

0.5 (8) 

8 

oXTt] 

0.5 (9) 

0.5 (6) 

0.5 (8) 

0.4 

1 

0.5 (6) 

0.25 (5) 

0.5 (4) 

{0.2,0.25,0.45} (3) 

2 

0.5 (8) 

0.5 (9) 

0177 ) 

0.5 (5) 

4 

0.5 (9) 

0.5 (10) 

0.5 (8) 

0.5 (8) 

6 

0.5 (9) 

0.5 (10) 

0.5 (9) 

0.5 (9) 

8 

0.5 (8) 

0.5 (9) 

0177 ) 

0.5 (8) 


( 7 , /3) = (2,1), (2, 2), (3,1), and (3, 2) are chosen in order to explore the widest 
spectrum of graph structures. The mixing parameter /i is varied from 0.05 to 
0.95. It means that each node shares a fraction (1 — /i) of its edges with the 
other nodes in its community and shares a fraction ^ of its edges with the 
nodes outside its community. Thus, low mixing parameters indicate strong 
community structure. The degree of overlap is determined by two parameters. 
On is the number of overlapping nodes, and Om is the number of commu¬ 
nities to which each overlapping node belongs. 0 „ here is set to 10 % of the 
total number of nodes. Instead of fixing Om, we allow it to vary from 1 to 8 
indicating the overlapping diversity of overlapping nodes. By increasing the 
value of Om, we create harder detection tasks. Also, we generate 10 network 
instances for each configuration of these parameters. Hence, each metric value 
for a certain configuration of LFR represents the average metric values of all 
10 instances. Since the experimental results are similar for all four pairs of ex¬ 
ponents ( 7 , P) = (2,1), (2, 2), (3,1), and (3, 2), for the sake of brevity, we only 
present the results for ( 7 , /3) = (2,1) here. In addition, these results are similar 
for different values of ^ and we cannot show all the results in one page as can be 
observed from Tables 24-26 in the Supplementary Materials, so here we only 
show the results for /r = 0.3,0.35, and 0.4. We choose fi = 0.3,0.35, and 0.4 
to better illustrate the results since with p, = 0.3,0.35, and 0.4 the commu¬ 
nity structures generated by LFR are around the boundary of well-separated 
communities and well-connected communities. For each node, ^ = 0.5 means 
that the number of its edges with other nodes in its communities is equal to 
the number of its edges with nodes outside its community, which makes the 
community structure difficult to discover. 
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Table 7 The best value of parameter k for CFinder and the corresponding number of 
community quality metrics (out of twelve) that are consistent with each other on determining 
this best k for the four combinations of two versions of belonging coefficient and two versions 
of belonging function on LFR benchmark networks with (q, f3) = (1, 2) and fi = 0.3, 0.35, 0.4. 
The best value in each row is marked by red italic font. 



CFinder 


Om 

(BC,BF)=(1,1) 

(BC,BF)=(1,2) 

(BC,BF)=(2,1) 

(BC,BF)=(2,2) 

0.3 

1 

4 (9) 

4 (9) 

4 (7) 

4 (6) 

2 

{3,4} (5) 

FTe) 

F(5) 

4T5) 

4 

376 ) 

Fie) 

F(4) 

r(4) 

6 

315) 

TVr) 

{3,4,11} (3) 

TD 

8 

315) 

TVr) 

{3,4,12} (3) 

TD 

0.35 

1 

4 (8) 

4 (8) 

4 (6) 

4 (5) 

2 


Fie) 

{3,8} (4) 

TD 

4 

3Uj 

TW 

F(4) 

TD 

6 

4T5) 

TJt) 

4T5) 

4T5) 

8 

4T6) 

TVr) 

4T4) 

4T5) 

0.4 

1 

4 (6) 

3 (e) 

{3,4,8} (4) 

4(4) 

2 

Fie) 

Fie) 

{3,9} (4) 

r(4) 

4 

Fie) 

TW) 

{3,9} (4) 

4T4) 

6 

ri) 

TW) 

4T5) 

4T5) 

8 

4T6) 

TVf) 

4T5) 

4T5) 


Table 8 The best value of threshold tr for SpeakEasy and the corresponding number of 
quality metrics (out of twelve) that are consistent with each other on determining this best 
tr for the four combinations of two versions of belonging coefficient and two versions of 
belonging function on LFR benchmark networks with = (1,2) and n = 0.3,0.35,0.4. 

The best value in each row is marked by red italic font. 



SpeakEasy 


Om 

(BC,BF)=(1,1) 

(BC,BF)=(1,2) 

(BC,BF)=(2,1) 

(BC,BF)=(2,2) 

0.3 

1 

0.75 (0) 

0.75 (9) 

0.75 (9) 

0.75 (9) 

2 

0.8 (7) 

0.8 (8) 

0.8 (6) 

0.8 (7) 

4 

fid 

fid 

fid 

fid 

6 

0.95 (4) 

0.95 (4) 

0.05 (3) 

{0.35,0.7} (4) 

8 

0.05 (3) 

{0.4,0.6} (4) 

{0.05,0.85} (3) 

{0.4,0.85} (3) 

0.35 

1 

0.8 (9) 

0.8 (9) 

0.8 (9) 

0.8 (9) 

2 

0.95 (5) 

0.95 (6) 

0.95 (4) 

{0.35,0.95} (3) 

4 

{0.05,0.95} (3) 

0.75 (4) 

{0.05,0.95} (3) 

0.95 (3) 

6 

0.85 (5) 

0.85 (4) 

0.85 (4) 

0.45 (3) 

8 

{0.05,0.87 (3) 

0.9 (5) 

{0.05,0.85,1} (3) 

{0.85,0.9} (3) 

0.4 

1 

0.15 (9) 

0.15 (9) 

0.15 (9) 

0.15 (9) 

2 

{0.8,0.85,0.95} (3) 

0.95 (4) 

{0.05,0.85,0.95} (3) 

0.95 (5) 

4 

0.95 (5) 

0.95 (e) 

0.95 (4) 

0.95 (4) 

6 

0.95 (4) 

{0.65,0.75} (3) 

0.75 (4) 

0.75 (6) 

8 

{0.05,0.9,1} (3) 

0.4 (5) 

{0.05,0.95,1} (3) 

{0.4,0.5} (3) 


Tables [6][8] show the best value of threshold r for SLPA, the best value of 
parameter k for CFinder, and the best value of threshold tr for SpeakEasy, 
respectively, along with the corresponding number of community quality met¬ 
rics (out of twelve) that are consistent with each other on determining this 
best r, this best fc, and this best tr for the four possible combinations of two 
versions of belonging coefficient and two versions of belonging function on LFR 
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Table 9 The number of times (maximum three since there are totally three community 
detection algorithms adopted) that the community quality metric is among those consistent 
with each other on determining the best value of threshold r for SLPA, the best value 
of parameter k for CFinder, and the best value of threshold tr for SpeakEasy on each 
configuration of LFR benchmark networks with (a,/3) = (1,2) and n = 0.3,0.35,0.4 when 
(BC,BF)=(1,2). The best value in the last row is marked by red italic font. 



Om 

Qov 

NQov 

^OV 

QT^ 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 

0.3 

1 

3 

2 

3 

3 

1 

0 

2 

0 

3 

3 

3 

3 

2 

3 

1 

3 

2 

1 

1 

2 

0 

2 

1 

1 

2 

4 

1 

2 

1 

1 

1 

3 

2 

2 

1 

2 

2 

2 

6 

2 

1 

2 

2 

0 

1 

2 

0 

3 

3 

3 

2 

8 

3 

1 

3 

2 

1 

1 

3 

0 

3 

3 

3 

2 

0.35 

1 

3 

1 

3 

2 

1 

0 

2 

0 

3 

2 

3 

3 

2 

2 

0 

2 

1 

1 

2 

2 

1 

1 

2 

2 

3 

4 

2 

2 

2 

2 

0 

0 

3 

0 

2 

2 

2 

2 

6 

2 

1 

2 

2 

0 

0 

2 

0 

3 

3 

3 

2 

8 

3 

1 

3 

2 

1 

0 

2 

0 

3 

2 

2 

2 

0.4 

1 

1 

0 

1 

1 

2 

1 

3 

0 

2 

3 

3 

3 

2 

2 

1 

2 

1 

1 

2 

2 

0 

1 

2 

2 

3 

4 

3 

2 

3 

2 

0 

1 

3 

0 

2 

2 

2 

2 

6 

3 

2 

3 

3 

0 

1 

3 

0 

2 

3 

3 

1 

8 

2 

1 

2 

2 

1 

0 

3 

0 

2 

3 

3 

2 


7 

(2.33) 

0 

(1.2) 

7 

(2.33) 

2 

(1.87) 

0 

(0.73) 

1 

(0.87) 

6 

(2.4) 

0 

(0.2) 

6 

(2.2) 

7 

(2.4) 

8 

{2.47) 

5 

(2.27) 


benchmark networks with (a, /3) = (1, 2) and /r = 0.3, 0.35, 0.4. Table [5] implies 
that (BC,BF)=(1,2) is the best among the four possible combinations on all 
configurations of LFR networks except /r = 0.3, Om = 2 and /i = 0.4, Om = 1 
when usiirg SLPA. Table [7] demonstrates that (BC,BF)=(1,2) is the best on all 
configurations of LFR networks when using CFinder. Table |8] indicates that 
(BC,BF) = (1,2) is the best among the four combinations on all configurations 
of LFR networks except /r = 0.35, Om = 6 and = 0.4, Om = 2,4 when using 
SpeakEasy. Consequently, we could conclude that the overlapping community 
quality metrics with the first version of belonging coefficient and the second 
version of the belongiirg function are the best among the four possible com¬ 
binations on LFR networks. Please refer to Tables 24-26 in Supplementary 
Materials for more detailed results on LFR benchmark networks. 

Table [9] shows the number of times (maximum three since there are totally 
three community detection algorithms adopted) that the community quality 
metric is among the metrics that are consistent with each other on determin¬ 
ing the best value of threshold r for SLPA, the best value of parameter k for 
CFinder, and the best value of threshold tr for SpeakEasy on each configu¬ 
ration of LER benchmark networks with (a, /3) = (1, 2) and /r = 0.3, 0.35, 0.4 
when (BC,BE)=(1,2). The last row in the table presents how many different 
configurations of LER networks support this metric across all three algorithms 
and how many times on average (maximum three) this metric got support from 
each of the 15 different configurations. This table shows results similar with 
those presented in Tabled Node-based overlapping extension and edge-based 
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overlapping extension of modularity perform equally well on LFR networks. 
The last row shows that the fitness function has the largest values and gener¬ 
ally local metrics are better than global metrics, and still performs poor. 
The reason is the same as given for the results in Table [SJ 


5 Conclusion 

In this paper, we determined which versions of the belonging coefficient and 
the belonging function are better for measuring quality of fuzzy overlapping 
community structures. We found that the first version of the belonging coeffi¬ 
cient is better than the second one, which means that the coefficient of a node 
belonging to a community should be the reciprocal of the number of communi¬ 
ties to which this node belongs. In addition, we found that the second version 
of the belonging function is better than the first version, meaning that the 
probability that two nodes belong to the same community should be the prod¬ 
uct, not the average, of their belonging coefficients. Moreover, we proposed 
overlapping extensions for localized modularity, modularity density, and eight 
local community quality metrics analogous to such extension of modularity. 
Based on the experimental results, we recommend using the edge-based over¬ 
lapping extension of modularity with the first version of belonging coefficient 
and with its own belonging function. We also recommend using the node-based 
overlapping extension of modularity and overlapping extension of modularity 
density with the first version of belonging coefficient and the second version 
of belonging function as the metrics of the global quality of overlapping com¬ 
munity structures. 

In the future, we plan to explore local community quality metrics for over¬ 
lapping community structures and investigate more community quality metrics 
incorporating community density to see whether putting community density 
into these metrics will make them perform better. 
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1 Real Network Datasets 

1.1 C. elegans Metabolic Network 

This is the metabolic network of C. elegans [5] with 453 nodes and 2025 edges. Table [T] shows the 
best value of threshold r for SLPA EH, the best value of parameter k for CFinder [mg, and the 
best value of threshold tr for SpeakEasy |7] determined by the twelve community quality metrics with 
four possible combinations of the two versions of belonging coefficient and two version of belonging 
function for this network. The last column in this table (and all the following tables) is the best value of 
threshold r for SLPA, the best value of parameter k for CFinder, or the best value of the threshold tr for 
SpeakEasy along with the corresponding number of community quality metrics (out of twelve) that are 
consistent with each other on determining this best r, this best k, and this best tr for each combination of 
belonging coefficient and belonging function. The table shows that the first version of belonging function 
is better than the second version of belonging function when using SLPA. For CFinder and SpeakEasy, it 
implies that (BC,BF)=(1,2) is the best among the four possible combinations of two versions of belonging 
coefficient and two versions of belonging function. In conclusion, two out of three algorithms show that 
(BC,BF)=(1,2) is the best on C. elegans metabolic network. 


Table 1. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on C. elegans metabolic network. 
The best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.5 

0.3 

0.3 

0.35 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.05 

0.5 

0.5 (6) 

SLPA 

(1,2) 

0.3 

0.3 

0.3 

0.4 

0.05 

0.5 

0.05 

0.5 

0.4 

0.05 

0.05 

0.4 

0.05 (4) 

(2,1) 

0.5 

0.3 

0.45 

0.4 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.05 

0.5 

0.5 (6) 


(2,2) 

0.4 

0.4 

0.45 

0.4 

0.05 

0.35 

0.05 

0.35 

0.4 

0.05 

0.05 

0.4 

0.4 (5) 


(1,1) 

9 

3 

4 

3 

3 

4 

4 

7 

7 

3 

3 

3 

3 (6) 

CFinder 

(1,2) 

4 

3 

4 

4 

3 

4 

3 

7 

3 

3 

3 

3 

3 (7) 

(2,1) 

7 

3 

4 

3 

3 

4 

4 

9 

9 

3 

3 

9 

3 (5) 


(2,2) 

5 

9 

4 

4 

3 

4 

3 

9 

9 

9 

3 

3 

{3,9} (4) 


(1,1) 

0.75 

0.8 

0.75 

0.75 

0.05 

0.8 

0.05 

1 

0.75 

0.9 

0.75 

0.75 

0.75 (6) 

SpeakEasy 

(1,2) 

0.75 

0.8 

0.75 

0.75 

0.75 

0.8 

0.9 

0.35 

0.75 

0.9 

0.75 

0.75 

0.75 (7) 

(2,1) 

0.75 

0.8 

0.75 

0.75 

0.05 

0.8 

0.05 

1 

0.75 

0.9 

0.75 

0.75 

0.75 (6) 


(2,2) 

0.75 

0.8 

0.75 

0.75 

0.75 

0.8 

0.9 

0.35 

0.75 

0.5 

0.9 

0.75 

0.75 (6) 


1.2 Dolphin Social Network 

This is a social network of frequent associations between 62 dolphins in a community living off Doubtful 
Sound, New Zealand m- There are 62 nodes and 159 edges. Table E] shows the best value of threshold r 
for SLPA, the best value of parameter k for CFinder, and the best value of threshold tr for SpeakEasy 
determined by the twelve community quality metrics with four possible combinations of the two versions 
of belonging coefficient and two version of belonging function on dolphin social network. We could learn 
from the table that all the three algorithms demonstrate that (BC,BF)=(1,2) is the best among the four 
combinations of belonging coefficient and belonging function. 


































Table 2. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on dolphin social network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1.1) 

0.5 

0.4 

0.4 

0.4 

0.05 

0.5 

0.05 

0.5 

0.5 

0.45 

0.5 

0.45 

0.5 (5) 

SLPA 

(1.2) 

0.4 

0.4 

0.4 

0.4 

0.05 

0.4 

0.05 

0.5 

0.4 

0.4 

0.05 

0.4 

0.4 (8) 

(2.1) 

0.5 

0.4 

0.05 

0.4 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.45 

0.5 (6) 


(2.2) 

0.4 

0.4 

0.05 

0.4 

0.05 

0.3 

0.05 

0.3 

0.4 

0.05 

0.05 

0.4 

{0.05,0.4} (5) 


(1.1) 

3 

3 

3 

3 

3 

3 

3 

4 

3 

3 

3 

3 

3 (11) 

CFinder 

(1.2) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 (12) 

(2.1) 

3 

3 

3 

4 

3 

3 

3 

5 

5 

5 

3 

4 

3 (7) 


(2.2) 

3 

3 

3 

4 

3 

3 

3 

5 

5 

5 

3 

4 

3 (7) 


(1.1) 

0.4 

0.55 

0.4 

0.2 

0.15 

0.85 

0.05 

0.7 

1 

0.45 

0.45 

0.4 

0.4 (3) 

SpeakEasy 

(1.2) 

0.4 

0.2 

0.4 

0.2 

0.4 

0.85 

0.45 

0.7 

1 

0.45 

0.45 

0.4 

0.4 (4) 

(2.1) 

0.15 

0.8 

0.15 

0.8 

0.15 

0.85 

0.15 

0.7 

0.7 

0.45 

0.45 

0.7 

0.15 (4) 


(2.2) 

0.4 

0.8 

0.15 

0.8 

0.4 

0.85 

0.45 

0.7 

0.7 

0.7 

0.45 

0.7 

0.7 (4) 


1.3 Email network 

This network represents email interchanges between members of the Univeristy Rovira i Virgili (Tarrag¬ 
ona) [TT]. It has 1133 nodes and 5451 edges. Table [3] shows the best value of threshold r for SLPA, the 
best value of parameter k for CFinder, and the best value of threshold tr for SpeakEasy determined by 
the twelve community quality metrics with four possible combinations of the two versions of belonging 
coefficient and two version of belonging function on email network. It can be observed from this table 
that all three algorithms demonstrate that (BC,BF)=(1,2) is the best among the four combinations of 
belonging coefficient and belonging function. 


Table 3. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on email network. The best value 
for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

QTs 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.5 

0.5 

0.5 

0.35 

0.1 

0.4 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.5 

0.4 

0.5 

0.35 

0.5 

0.5 

0.5 

0.5 

0.5 (10) 

(2,1) 

0.5 

0.5 

0.3 

0.4 

0.1 

0.4 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2,2) 

0.5 

0.5 

0.3 

0.5 

0.5 

0.4 

0.5 

0.35 

0.4 

0.5 

0.5 

0.5 

0.5 (8) 


(1,1) 

4 

3 

3 

3 

3 

3 

4 

3 

3 

3 

3 

3 

3 (10) 

CFinder 

(1,2) 

4 

3 

3 

3 

3 

3 

4 

3 

3 

3 

3 

3 

3 (10) 

(2,1) 

4 

5 

3 

4 

3 

3 

4 

10-12 

9 

10-12 

4 

8 

4(4) 


(2,2) 

4 

7 

3 

5 

3 

3 

4 

9-12 

9-12 

9-12 

4 

7 

{3,4,9-12} (3) 


(1,1) 

0.3 

0.85 

1 

0.8 

0.05 

0.75 

0.05 

0.7 

0.55 

0.9 

0.9 

0.9 

0.9 (3) 

SpeakEasy 

(1,2) 

1 

0.85 

1 

0.8 

1 

0.75 

0.85 

0.7 

0.5 

0.9 

0.9 

0.5 

1 (3) 

(2,1) 

0.3 

0.85 

1 

0.8 

0.05 

0.75 

0.05 

0.7 

0.95 

0.9 

0.9 

0.5 

{0.05,0.9} (2) 


(2,2) 

1 

0.85 

1 

0.8 

1 

0.75 

0.85 

0.7 

0.5 

0.5 

0.9 

0.5 

{0.5,1} (3) 


1.4 American College Football Network 

The network represents the schedule of games between college football teams in a single season [S] . There 
are 115 nodes and 613 edges. Table 0] shows the best value of threshold r for SLPA, the best value of 
parameter k for CFinder, and the best value of threshold tr for SpeakEasy determined by the twelve 
community quality metrics with four possible combinations of the two versions of belonging coefficient 
and two version of belonging function on American college football network. It can be observed from this 
table that SLPA performs best with (BC,BF)=(1,2), CFinder implies that (BC,BF) = (1,1) is the best, 
while SpeakEasy has no preferences. 


1.5 Jazz Musicians Network 

This is a network with 198 nodes and 2742 edges of collaborations between jazz musicians [in]. Table El 
shows the best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best 


































































Table 4. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve quality metrics with four possible combinations of the 
two versions of belonging coefficient and two version of belonging function on American college football network. 
The best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

QL 

Q7, 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1.1) 

0.4 

0.45,0.5 

0.45,0.5 

0.45,0.5 

0.05 

0.45,0.5 

0.05 

0.45,0.5 

0.45,0.5 

0.45,0.5 

0.45,0.5 

0.45,0.5 

{0.45,0.5} (9) 

SLPA 

(1,2) 

0.4 

0.45,0.5 

0.45,0.5 

0.45,0.5 

3.45,0.5 

0.45,0.5 

3.45,0.5 

0.45,0.5 

0.45,0.5 

3.45,0.5 

0.45,0.5 

3.45,0.5 

10 . 45 , 0 . 5 } ( 11 ) 

(2.1) 

0.4 

0.45,0.5 

0.3 

0.45,0.5 

0.05 

0.45,0.5 

0.05 

0.45,0.5 

0.45,0.5 

3.45,0.5 

0.45,0.5 

3.45,0.5 

{0.45,0.5} (8) 


(2.2) 

0.4 

0.45,0.5 

0.3 

0.45,0.5 

0.25 

0.45,0.5 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 (7) 


(1,1) 

4 

5 

4 

4 

3 

4 

3 

9 

4 

3 

3 

4 

TJe) 

CFinder 

(1,2) 

4 

5 

4 

4 

3 

4 

3 

9 

3 

3 

3 

4 

{3,4} (5) 

(2,1) 

4 

6 

4 

4 

3 

4 

3 

9 

9 

9 

3 

4 

475) 


(2.2) 

4 

6 

4 

4 

3 

4 

3 

9 

9 

9 

3 

4 

4 (5) 


(1,1) 

3.1,0.6 

0.75 

0.6 

0.1,0.6 

0.1,0.6 

0.75 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.6 

0.1,0.6 

0.6 {10) 

SpeakEasy 

(1,2) 

3.1,0.6 

0.75 

0.6 

0.1,0.6 

0.1,0.6 

0.75 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.6 

0.1,0.6 

0.6 (10) 

(2,1) 

3.1,0.6 

0.75 

0.6 

0.1,0.6 

0.1,0.6 

0.75 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.6 

0.1,0.6 

0.6 (10) 


(2,1) 

0.1,0.6 

0.75 

0.6 

0.1,0.6 

0.1,0.6 

0.75 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.1,0.6 

0.6 

0.1,0.6 

0.6 (10) 


value of threshold tr for SpeakEasy determined by the twelve community quality metrics with four 
possible combinations of the two versions of belonging coefficient and two version of belonging function 
on jazz musicians network. From this table, we could see that the first version of belonging coefficient 
is better than the second version when using SLPA and CFinder, while SpeakEasy demonstrates that 
(BC,BF) = (2,2) is the best among the four combinations. In summary, two of the three algorithms support 
that the hrst version of belonging coefficient is better than the second one on jazz musicians network. 


Table 5. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on jazz musicians network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.5 

0.5 

0.5 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 {10) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.5 

0.1 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 {10) 

(2,1) 

0.5 

0.5 

0.4 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(2,2) 

0.5 

0.5 

0.4 

0.5 

0.1 

0.5 

0.1 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1,1) 

14 

3 

10 

10 

3 

8 

3 

3 

3 

3 

3 

3 

3 {8) 

CFinder 

(1,2) 

10 

3 

10 

10 

3 

8 

3 

3 

3 

3 

3 

3 

3 {8) 

(2,1) 

14 

10 

8 

10 

3 

8 

3 

3 

3 

18 

3 

3 

3 (6) 


(2,2) 

10 

17 

8 

10 

3 

8 

3 

3 

19,20 

19,20 

3 

3 

3 (5) 


(1,1) 

0.75 

0.5 

0.75 

0.75 

0.1 

0.85 

0.1 

0.75 

0.75 

0.55 

0.55 

0.8 

0.75 (5) 

SpeakEasy 

(1,2) 

0.75 

0.5 

0.75 

0.75 

0.75 

0.85 

0.7 

0.75 

0.5 

0.55 

0.55 

0.5 

0.75 (5) 

(2,1) 

0.75 

0.5 

0.75 

0.75 

0.1 

0.85 

0.1 

0.75 

0.75 

0.55 

0.55 

0.8 

0.75 (5) 


(2,2) 

0.75 

0.5 

0.75 

0.75 

0.75 

0.85 

0.7 

0.75 

0.75 

0.8 

0.55 

0.8 

0.75 {6) 


1.6 Zachary’s Karate Club Network 

This network represents the friendships between 34 members of a karate club at a US university during 
two years [S3]. It has 34 nodes and 78 edges. Table [3] shows the best value of threshold r for SLPA, the 
best value of parameter k for CFinder, and the best value of threshold tr for SpeakEasy determined by 
the twelve community quality metrics with four possible combinations of the two versions of belonging 
coefficient and two version of belonging function on Zachary’s karate club network. It can be observed from 
this table that all three algorithms show that (BC,BF)=(1,2) is the best among the four combinations of 
belonging coefficient and belonging function. 


1.7 Les Miserables Network 

This is a coappearance network of characters in the novel Les Miserables m- It has 77 nodes and 254 
edges. Table |7| shows the best value of threshold r for SLPA, the best value of parameter k for CFinder, 
and the best value of threshold tr for SpeakEasy determined by the twelve community quality metrics 
with four possible combinations of the two versions of belonging coefficient and two version of belonging 
function on Les Miserables network. The table shows that for all three algorithms (BC,BF)=(1,2) is the 
best among the four combinations of belonging coefhcient and belonging function. 


































































Table 6. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve quality metrics with four possible combinations of the 
two versions of belonging coefficient and two version of belonging function on Zachary’s karate club network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1,1) 

0.5 

0.5 

0.45 

0.5 

0.1 

0.35 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.45 

0.45 

0.45 

0.45 

0.15 

0.35 

0.45 

0.45 

0.45 

0.45 

0.45 

0.45 

0.45 (10) 

(2,1) 

0.5 

0.5 

0.45 

0.5 

0.1 

0.35 

0.05 

0.5 

0.5 

0.5 

0.5 

0.45 

0.5 (7) 


(2,2) 

0.45 

0.45 

0.45 

0.45 

0.15 

0.35 

0.45 

0.45 

0.45 

0.45 

0.45 

0.45 

0.45 {10) 


(1,1) 

3 

3 

3 

3 

3 

3 

3 

5 

3 

3 

3 

3 

3 {11) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

5 

3 

3 

3 

3 

3 {11) 

(2,1) 

4 

3 

3 

3 

3 

3 

3 

5 

5 

5 

3 

3 

3 (8) 


(2,2) 

4 

4 

3 

3 

3 

3 

3 

5 

5 

5 

3 

3 

3 (7) 


(1,1) 

0.45 

0.45 

0.45 

0.45 

0.2 

0.95 

0.05 

0.7,0.75 

0.45 

0.65 

0.65 

0.65 

0.45 (5) 

SpeakEasy 

(1,2) 

0.45 

0.45 

0.45 

0.45 

0.45 

0.95 

0.65 

0.15 

0.45 

0.65 

0.65 

0.65 

0.45 {6) 

(2,1) 

0.45 

0.45 

0.15 

0.45 

0.2 

0.95 

0.05 

0.9 

0.45 

0.65 

0.65 

0.45 

0.45 (5) 


(2,2) 

0.45 

0.45 

0.15 

0.45 

0.2 

0.35 

0.65 

0.9 

0.45 

0.45 

0.85 

0.45 

0.45 {6) 


Table 7. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Les Miserables network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

^ ov 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.35 

0.5 

0.25 

0.35 

0.05 

0.5 

0.05 

0.5 

0.5 

0.25 

0.25 

0.25 

{0.25,0.5} (4) 

SLPA 

(1,2) 

0.35 

0.5 

0.25 

0.35 

0.15 

0.5 

0.25 

0.35 

0.25 

0.25 

0.25 

0.25 

0.25 {6) 

(2,1) 

0.35 

0.5 

0.15 

0.4 

0.05 

0.5 

0.05 

0.5 

0.5 

0.25 

0.25 

0.25 

0.5 (4) 


(2,2) 

0.35 

0.5 

0.15 

0.35 

0.15 

0.5 

0.15 

0.35 

0.25 

0.15 

0.15 

0.25 

0.15 (5) 


(1,1) 

6 

3 

4 

4 

3 

3 

5 

8 

3 

3 

3 

3 

3 (7) 

CFinder 

(1,2) 

5 

3 

4 

5 

3 

3 

3 

4 

3 

3 

3 

3 

3 {8) 

(2,1) 

6 

6 

4 

6 

3 

3 

5 

9 

9 

9 

3 

6 

6 (4) 


(2,2) 

6 

6 

4 

6 

3 

3 

3 

9 

9 

9 

3 

6 

{3,6} (4) 


(1,1) 

0.65 

0.85 

0.65 

0.6 

0.85 

0.85 

0.1 

0.55,0.95 

0.55 

0.85 

0.85 

0.6 

0.85 (5) 

SpeakEasy 

(1,2) 

0.65 

0.85 

0.65 

0.6 

0.85 

0.85 

0.85 

0.55,0.95 

0.55 

0.85 

0.85 

0.6 

0.85 {6) 

(2,1) 

0.65 

0.7 

0.65,0.8 

0.6 

0.85 

0.85 

0.1 

0.95 

0.55 

0.85 

0.85 

0.6 

0.85 (4) 


(2,2) 

0.65,0.8 

0.7 

0.65,0.8 

0.45,0.6 

0.85 

0.85 

0.85 

0.45,0.95 

0.55 

0.95 

0.85 

0.45,0.6 

0.85 (4) 


Table 8. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve quality metrics with four possible combinations of the 
two versions of belonging coefficient and two version of belonging function Network Science coauthorship network. 
The best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.05 

0.4 

0.5 

0.35 

0.05 

0.4 

0.05 

0.5 

0.5 

0.35 

0.35 

0.35 

0.35 (4) 

SLPA 

(1,2) 

0.5 

0.4 

0.5 

0.4 

0.5 

0.4 

0.25 

0.5 

0.35 

0.2 

0.15 

0.35 

0.5 (4) 

(2,1) 

0.05 

0.4 

0.35 

0.35 

0.05 

0.4 

0.05 

0.5 

0.5 

0.35 

0.35 

0.35 

0.35 (5) 


(2,2) 

0.5 

0.4 

0.35 

0.35 

0.15 

0.4 

0.25 

0.5 

0.5 

0.2 

0.15 

0.35 

{0.35,0.5} (3) 


(1,1) 

3 

3 

3 

3 

3 

3 

3 

7 

3 

3 

3 

3 

3 (11) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 {12) 

(2,1) 

3 

3 

3 

3 

3 

3 

3 

11-20 

11-20 

11-20 

3 

3 

3 (9) 


(2,2) 

3 

3 

3 

3 

3 

3 

3 

9-20 

9-20 

9-20 

3 

3 

3 (9) 


(1,1) 

0.05 

0.25 

0.25 

0.25 

0.05 

0.7 

0.05 

1 

1 

0.7 

0.7 

0.7 

0.7 (4) 

SpeakEasy 

(1,2) 

0.25 

0.25 

0.25 

0.25 

0.25 

0.7 

0.7 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 {10) 

(2,1) 

0.05 

0.3 

0.15 

0.25 

0.05 

0.7 

0.05 

0.85 

1 

0.25 

0.7 

1 

0.05 (3) 


(2,2) 

0.15 

0.3 

0.15 

0.25 

0.25 

0.7 

0.7 

0.4 

0.2 

0.2 

0.9 

1 

{0.15,0.2,0.25,0.7} (2) 


































































































1.8 Network Science Coauthorship Network 

This is a coauthorship network of scientists working on network theory and experiment m- There are 
1461 nodes and 2742 edges. Table [5] shows the best value of threshold r for SLPA, the best value of 
parameter k for CFinder, and the best value of threshold tr for SpeakEasy determined by the twelve 
community quality metrics with four possible combinations of the two versions of belonging coefficient 
and two version of belonging function on Network Science coauthorship network. We can observe from 
this table that SLPA shows that (BC,BF)=(2,1) is the best among the four combinations, while CFinder 
and SpeakEasy indicate that (BC,BF)=(1,2) is the best. Thus, we could conclude that (BC,BF)=(1,2) 
is the best on Network Science coauthorship network. 


1.9 PGP Network 

This is the largest connected component of the network of users of the Pretty-Good-Privacy (PGP) 
algorithm for secure information interchange [5]. It has 10680 nodes and 24316 edges in total. Table IHl 
shows the best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible 
combinations of the two versions of belonging coefficient and two version of belonging function on PGP 
Network. It can be seen from the table that all three algorithms show that (BG,BF)=(1,2) is the best 
among the four combinations of belonging coefficient and belonging function. 


Table 9. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on PGP network. The best value 
for subcolunm of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

Qov 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.05 

0.5 

0.5 

0.5 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.05 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (10) 

(2,1) 

0.05 

0.5 

0.2 

0.5 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2,2) 

0.45 

0.5 

0.2 

0.5 

0.05 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1,1) 

4 

3 

3 

3 

3 

3 

3 

13 

3 

3 

3 

3 

3 (10) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 (12) 

(2,1) 

4 

3 

3 

6 

3 

3 

3 

18 

18 

19 

3 

13 

3 (6) 


(2,2) 

3 

3 

3 

6 

3 

3 

3 

18 

18 

14,15 

3 

3 

3 (8) 


(1,1) 

0.05 

0.25 

0.85 

0.9 

0.05 

0.7 

0.05 

0.75 

0.85 

0.85 

0.85 

0.85 

0.85 (5) 

SpeakEasy 

(1,2) 

0.85 

0.25 

0.85 

0.9 

0.85 

0.7 

0.85 

0.75 

0.85 

0.85 

0.25 

0.85 

0.85 (7) 

(2,1) 

0.05 

0.8 

0.85 

0.9 

0.05 

0.7 

0.05 

0.75 

0.75 

0.85 

0.85 

0.75 

{0.05,0.75,0.85} (3) 


(2,2) 

0.85 

0.8 

0.85 

0.9 

0.85 

0.7 

0.85 

0.6 

0.75 

0.75 

0.2 

0.75 

0.85 (4) 


Table 10. The best value of threshold r for SLPA and the best value of threshold tr for SpeakEasy determined by 
the twelve community quality metrics with four possible combinations of the two versions of belonging coefficient 
and two version of belonging function on political blogs network. The best value for subcolumn of the last column 
is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1.1) 

0.5 

0.5 

0.5 

0.4 

0.5 

0.35 

0.3 

0.4 

0.45 

0.5 

0.5 

0.5 

0.5 (7) 

SLPA 

(1.2) 

0.5 

0.5 

0.5 

0.4 

0.5 

0.4 

0.5 

0.15 

0.4 

0.5 

0.5 

0.5 

0.5 (8) 

(2.1) 

0.5 

0.5 

0.05 

0.4 

0.5 

0.4 

0.3 

0.4 

0.45 

0.5 

0.5 

0.5 

0.5 (6) 


(2.2) 

0.25 

0.25 

0.05 

0.25 

0.5 

0.4 

0.5 

0.05 

0.35 

0.5 

0.5 

0.3 

0.5 (4) 


(1.1) 

0.85 

0.45 

0.85 

0.7 

0.9 

0.35 

0.05 

0.7 

0.7 

0.25 

0.45 

0.7 

0.7 (4) 

SpeakEasy 

(1.2) 

0.85 

0.45 

0.85 

0.7 

0.9 

0.35 

0.9 

0.7 

0.7 

0.25 

0.45 

0.7 

0.7 (4) 

(2,1) 

0.85 

0.45 

0.45 

0.7 

0.9 

0.35 

0.05 

0.7 

0.5 

0.25 

0.45 

0.5 

0.45 (3) 


(2.2) 

0.8 

0.8 

0.45 

0.7 

0.9 

0.35 

0.9 

0.7 

0.5 

0.4 

0.6 

0.5 

(0.5,0.7,0.8,0.91 (2) 


1.10 Political Blogs Network 

This is a directed network of hyperlinks between weblogs on US politics, recorded in 2005 by Adamic 
and Glance [I]. There are 1224 nodes and 19022 edges. Table fTOl shows the best value of threshold r for 
SLPA and the best value of threshold tr for SpeakEasy determined by the twelve community quality 
metrics with four possible combinations of the two versions of belonging coefficient and two version 






























































of belonging function on political blogs network. Results for CFinder are not provided because it has 
not finished running on this network for more than two months processing many potential fc-cliques 
resulting from dense connections. It can be seen from the table that both SLPA and SpeakEasy imply 
that (BC,BF)=(1,2) is the best among the four combinations of belonging coefficient and belonging 
function. 


1.11 Political Books Network 

This is a network of books about US politics published around the time of the 2004 presidential election 
and sold by the online bookseller Amazon.com M- It has 105 nodes and 441 edges in total. Edges 
between books indicate frequent copurchasing of books by the same buyers. Table [TT] shows the best 
value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of threshold 
tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations of 
the two versions of belonging coefficient and two version of belonging function on political books network. 
We could learn from the table that SLPA shows that the first version of belonging function is better than 
the second version, CFinder implies that the first version of belonging coefficient is better than the second 
one, and SpeakEasy indicates that (BC,BF)=(1,2) is the best among all four combinations. In summary, 
there are two out of three algorithms support that (BC,BF) = (1,2) is the best on political books network. 


Table 11. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging fnnction on political books network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

qL 

QTs 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1.1) 

0.5 

0.5 

0.2 

0.4 

0.05 

0.4 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 

SLPA 

(1.2) 

0.5 

0.5 

0.2 

0.4 

0.1 

0.4 

0.1 

0.4 

0.2 

0.2 

0.2 

0.2 

0.2 (5) 

(2.1) 

0.5 

0.5 

0.1 

0.4 

0.05 

0.4 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.5 

0.5 

0.1 

0.4 

0.1 

0.4 

0.1 

0.4 

0.2 

0.2 

0.2 

0.2 

0.2 (4) 


(1.1) 

4 

3 

3 

3 

3 

3 

3 

6 

3 

3 

3 

3 

3 {10) 

CFinder 

(1.2) 

3 

3 

3 

4 

3 

3 

3 

6 

3 

3 

3 

3 

3 {10) 

(2.1) 

4 

4 

3 

4 

3 

3 

3 

6 

6 

3 

3 

3 

3 (7) 


(2.2) 

4 

4 

3 

4 

3 

3 

3 

6 

6 

6 

3 

3 

3 (6) 


(1.1) 

0.95 

1 

0.5 

0.95 

0.25 

0.95 

0.05 

0.5 

0.9 

0.95 

0.95 

0.9 

0.95 (5) 

SpeakEasy 

(1.2) 

0.95 

0.55 

0.5 

0.95 

0.95 

0.95 

0.9 

0.5 

0.9 

0.95 

0.95 

0.9 

0.95 {6) 

(2.1) 

0.5 

1 

0.5 

0.95 

0.25 

0.95 

0.05 

0.5 

0.85 

0.9 

0.95 

0.9 

{0.5,0.95} (3) 


(2.2) 

0.5 

0.55 

0.5 

0.95 

0.95 

0.95 

0.9 

0.5 

0.85 

0.85 

0.95 

0.9 

0.95 (4) 


Table 12. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Indian railway network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

QTs 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.15 

0.5 

0.5 

0.5 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.5 

0.35 

0.5 

0.5 

0.05 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 {9) 

(2,1) 

0.05 

0.5 

0.15 

0.5 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2,2) 

0.5 

0.35 

0.15 

0.5 

0.05 

0.45 

0.5 

0.35 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(1,1) 

6 

3 

4 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 {9) 

CFinder 

(1,2) 

6 

3 

4 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 {9) 

(2,1) 

6 

6 

4 

5 

3 

3 

3 

10 

10 

10 

3 

4 

3 (4) 


(2,2) 

6 

6 

4 

6 

3 

3 

3 

10 

10 

10 

3 

4 

3 (4) 


(1,1) 

0.55 

0.8 

0.45 

1 

0.05 

0.7 

0.05 

0.95 

0.95 

0.8 

0.8 

0.8 

0.8 (4) 

SpeakEasy 

(1,2) 

0.45 

0.7 

0.45 

1 

0.8 

0.7 

0.8 

0.95 

0.55 

0.8 

0.8 

0.8 

0.8 (5) 

(2,1) 

0.55 

0.8 

0.45 

1 

0.05 

0.7 

0.05 

0.95 

0.95 

0.8 

0.8 

0.55 

0.8 (3) 


(2,2) 

0.45 

0.8 

0.45 

1 

0.8 

0.7 

0.8 

0.95 

0.55 

0.55 

0.8 

0.55 

0.8 (4) 


1.12 Indian Railway Network 

This network consists of nodes representing Indian railway stations, where two stations are connected by 
an edge if there exists at least one train-route such that both stations are scheduled stops on that route 


































































[3]. There are 297 nodes and 1213 edges. Table [T^] shows the best value of threshold r for SLPA, the 
best value of parameter k for CFinder, and the best value of threshold tr for SpeakEasy determined by 
the twelve community quality metrics with four possible combinations of the two versions of belonging 
coefficient and two version of belonging function on Indian railway network. It can be seen from the table 
that all three algorithms show that (BC,BF)=(1,2) is the best among the four combinations of belonging 
coefficient and belonging function. 


1.13 Santa Fe Institute Collaboration Network 

This is the largest connected component of the collaboration network of scientists at the Santa Fe Institute, 
an interdisciplinary research center in Santa Fe, New Mexico [S]. It has 118 nodes and 200 edges. Nodes 
in this network represent scientists in residence at the Santa Fe Institute during any part of calendar year 
1999 or 2000 and their collaborators. An edge is drawn between a pair of scientists if they coauthored 
one or more articles during the same time period. The network includes all journal and book publications 
by the scientists involved, along with all papers that appeared in the institutes technical reports series. 
Table [T3] shows the best value of threshold r for SLPA, the best value of parameter k for CFinder, 
and the best value of threshold tr for SpeakEasy determined by the twelve community quality metrics 
with four possible combinations of the two versions of belonging coefficient and two version of belonging 
function on Santa Ee Institute collaboration network. We could learn that all three algorithms show that 
(BC,BF)=(1,2) is the best among the four combinations of belonging coefficient and belonging function. 


Table 13. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve quality metrics with four combinations of the two versions 
of belonging coefhcient and two version of belonging function on Santa Fe Institute collaboration network. The 
best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1.1) 

0.4 

0.5 

0.4 

0.5 

0.05 

0.5 

0.05 

0.5 

0.4 

0.4 

0.4 

0.4 

0.4 (6) 

SLPA 

(1,2) 

0.4 

0.5 

0.4 

0.5 

0.1 

0.5 

0.4 

0.5 

0.4 

0.4 

0.4 

0.4 

0.4 (7) 

(2,1) 

0.4 

0.5 

0.1 

0.5 

0.05 

0.5 

0.05 

0.5 

0.4 

0.4 

0.4 

0.4 

0.4 (5) 


(2,2) 

0.4 

0.5 

0.1 

0.5 

0.1 

0.5 

0.1 

0.5 

0.4 

0.4 

0.4 

0.4 

0.4 (5) 


(1.1) 

3 

3 

3 

3 

3 

3 

3 

5 

4 

3 

3 

3 

3 (10) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

4 

3 

3 

3 

3 

3 (11) 

(2,1) 

3 

3 

3 

4 

3 

3 

3 

5 

5 

5 

3 

5 

3 (7) 


(2,2) 

3 

3 

3 

3 

3 

3 

3 

5 

5 

5 

3 

4 

3 (8) 


(1,1) 

0.1 

0.9 

0.65 

0.65 

0.05 

0.9 

0.05 

1 

0.95 

0.9 

0.9 

0.9 

0.9 (5) 

SpeakEasy 

(1.2) 

0.55 

0.9 

0.65 

0.35 

0.35 

0.9 

0.9 

1 

0.95 

0.9 

0.9 

0.35 

0.9 (5) 

(2,1) 

0.1 

0.9 

0.1 

0.65 

0.05 

0.9 

0.05 

1 

0.95 

0.9 

0.9 

0.65 

0.9 (4) 


(2,2) 

0.1 

0.9 

0.1 

0.65 

0.35 

0.9 

0.9 

0.65 

0.65 

1 

0.9 

0.65 

{0.65,0.9} (4) 


1.14 Protein-protein Interaction Networks 

We consider eight protein-protein interaction networks in the experiments. Collins_cyc, Collins_cyc_w, 
Collinsjnips, Collins_sgd, Gavin_cyc, Gavin_cyc_w, Gavin_mips, and Gavin_sgd are two kinds 
(referred as Collins [1] and Gavin [5] here) of popular high throughput protein-protein interaction net¬ 
works derived from measurements obtained by affinity purification and mass spectrometry (AP-MS) 
techniques [7]. These two kinds of networks are further refined with three gold-standards for protein 
complexes, including the classic Munich Information Center for Protein Sequences (MIPS) [17] and the 
more recent Saccharomyces Genome Database (SGD) [H]. The complete MIPS dataset as well as par¬ 
tial information from SGD are incorporated into a third protein complex list known as CYG2008 |3D] . 
Thus, we have Collins_cyc, Collinsjnips, Collins_sgd, Gavin_cyc, Gavin_mips, and Gavin_sgd, 
respectively. Collins_cyc_w and Gavin_cyc_w are respectively the weighted versions of Collins_cyc 
and Gavin_cyc, in which the weight is proportional to the probability a given interaction pair truly 
exists. 

Tables [T31I3T] show the best value of threshold r for SLPA, the best value of parameter k for CFinder, 
and the best value of threshold tr for SpeakEasy determined by the twelve community quality metrics 
with four possible combinations of the two versions of belonging coefficient and two version of belonging 
function on the eight protein-protein interaction networks. Results for CFinder on Collins_cyc_w and 
Gavin_cyc_w are not provided because it has not finished running on Collins_cyc_w and Gavin_cyc_w 



































Table 14. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Collins_cyc. The best value 
for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

^OV 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1,1) 

0.05 

0.5 

0.5 

0.4 

0.1 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.5 

0.45 

0.5 

0.5 

0.2 

0.5 

0.2 

0.5 

0.4 

0.2 

0.2 

0.4 

0.5 (5) 

(2,1) 

0.05 

0.5 

0.5 

0.4 

0.1 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(2,2) 

0.5 

0.5 

0.5 

0.5 

0.2 

0.5 

0.2 

0.5 

0.4 

0.2 

0.2 

0.4 

0.5 (6) 


(1,1) 

3 

3 

3 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 {11} 

CFinder 

(1,2) 

3 

3 

3 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 (11) 

(2,1) 

3 

9 

3 

6 

3 

3 

3 

18 

18 

20 

3 

3 

3 (7) 


(2,2) 

3 

3 

3 

6 

3 

3 

3 

17-20 

17-20 

17-20 

3 

3 

3 (8) 


(1,1) 

0.05 

0.9 

0.9 

1 

0.9 

0.9 

0.05 

0.8 

0.9 

0.9 

0.9 

0.9 

0.9 (8) 

SpeakEasy 

(1,2) 

0.9 

0.9 

0.9 

1 

0.9 

0.9 

0.9 

0.8 

0.9 

0.9 

0.9 

0.9 

0.9 (10) 

(2,1) 

0.05 

0.55 

0.9 

0.95 

0.05 

0.9 

0.05 

0.8 

0.95 

0.9 

0.9 

0.95 

0.9 (4) 


(2,2) 

0.9 

0.7 

0.9 

0.95 

0.9 

0.9 

0.9 

0.8 

0.25 

0.25 

0.9 

0.95 

0.9 (6) 


Table 15. The best value of threshold r for SLPA and the best value of threshold tr for SpeakEasy determined by 
the twelve community quality metrics with four possible combinations of the two versions of belonging coefficient 
and two version of belonging function on Collins_cyc_w. The best value for subcolumn of the last column is 
marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

Qov 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

P 

D 



(1,1) 

0.05 

0.35 

0.05 

0.35 

0.05 

0.25 

0.05 

0.5 

0.05 

0.05 

0.05 

0.2 

0.05 (7) 

SLPA 

(1,2) 

0.05 

0.35 

0.05 

0.35 

0.05 

0.25 

0.4 

0.05 

0.05 

0.05 

0.05 

0.35 

0.05 (7) 

(2,1) 

0.05 

0.35 

0.05 

0.35 

0.05 

0.25 

0.05 

0.5 

0.5 

0.05 

0.05 

0.2 

0.05 (6) 


(2,2) 

0.05 

0.5 

0.05 

0.35 

0.05 

0.25 

0.05 

0.05 

0.05 

0.05 

0.05 

0.2 

0.05 (8) 


(1,1) 

0.1 

0.55 

0.55 

0.9 

0.1 

0.65 

0.1 

0.95 

0.55 

0.55 

0.55 

0.55 

0.55 (6) 

SpeakEasy 

(1,2) 

0.55 

0.55 

0.55 

0.9 

0.55 

0.65 

0.55 

0.95 

0.55 

0.55 

0.55 

0.55 

0.55 (9) 

(2,1) 

0.1 

0.4 

0.55 

0.9 

0.1 

0.65 

0.1 

0.95 

0.8 

0.55 

0.55 

0.45 

{0.1,0.55} (3) 


(2,2) 

0.55 

0.4 

0.55 

0.9 

0.55 

0.65 

0.55 

0.8 

0.8 

0.8 

0.7 

0.45 

0.55 (4) 


Table 16. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Collinsjnips. The best 
value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

^OV 

Q7 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1,1) 

0.15 

0.45 

0.45 

0.45 

0.05 

0.5 

0.05 

0.5 

0.45 

0.15 

0.05 

0.45 

0.45 (5) 

SLPA 

(1,2) 

0.45 

0.45 

0.45 

0.45 

0.05 

0.5 

0.15 

0.5 

0.45 

0.05 

0.05 

0.45 

0.45 (6) 

(2,1) 

0.15 

0.45 

0.05 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.45 

0.05 

0.45 

{0.05,0.45} 4 


(2,2) 

0.45 

0.5 

0.05 

0.45 

0.05 

0.5 

0.15 

0.5 

0.45 

0.05 

0.05 

0.45 

{0.05,0.45} (4) 


(1,1) 

4 

3 

3 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 (10) 

CFinder 

(1,2) 

4 

3 

3 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 (10) 

(2.1) 

4 

7 

3 

7 

3 

3 

3 

19 

19 

20 

3 

3 

3 ( 6 ) 


(2.2) 

4 

7 

3 

7 

3 

3 

3 

16-20 

16-20 

16-20 

3 

3 

3 ( 6 ) 


(1.1) 

0.05 

0.4 

0.4 

1 

0.4 

0.4 

0.35 

0.6 

0.7 

0.4 

0.4 

0.4 

0.4 (7) 

SpeakEasy 

(1,2) 

0.4 

0.4 

0.4 

1 

0.4 

0.4 

0.4 

0.6 

0.4 

0.4 

0.4 

0.4 

0.4 ( 10 ) 

(2,1) 

0.05 

0.4 

0.25 

1 

0.4 

0.4 

0.35 

0.6 

0.75 

0.4 

0.4 

0.75 

0.4 (5) 


(2,2) 

0.25 

0.95 

0.25 

1 

0.4 

0.4 

0.4 

0.6 

0.6 

0.6 

0.25 

0.75 

{0.25,0.4,0.6} (3) 






























































































Table 17. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Collins_sgd. The best value 
for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1,1) 

0.05 

0.5 

0.35 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.35 

0.35 

0.45 

0.5 (4) 

SLPA 

(1,2) 

0.35 

0.45 

0.35 

0.45 

0.1 

0.5 

0.1 

0.5 

0.35 

0.1 

0.1 

0.45 

0.1 (4) 

(2,1) 

0.05 

0.5 

0.1 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.35 

0.35 

0.45 

0.5 (4) 


(2,2) 

0.35 

0.5 

0.1 

0.45 

0.1 

0.3 

0.1 

0.5 

0.1 

0.1 

0.1 

0.4 

0.1 {6} 


(1,1) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 (12) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 (12) 

(2,1) 

3 

3 

3 

6 

3 

3 

3 

16-20 

16-20 

16-20 

3 

3 

3 (8) 


(2,2) 

3 

3 

3 

6 

3 

3 

3 

16-20 

16-20 

16-20 

3 

3 

3 (8) 


(1,1) 

0.1 

0.8 

0.8 

0.75 

0.05 

0.8 

0.05 

0.5 

0.8 

0.8 

0.8 

0.8 

0.8 (7) 

SpeakEasy 

(1,2) 

0.8 

0.8 

0.8 

0.75 

0.8 

0.8 

0.8 

0.35 

0.8 

0.8 

0.8 

0.8 

0.8 (10) 

(2,1) 

0.05 

0.8 

0.35 

0.5 

0.05 

0.8 

0.15 

0.5 

0.5 

0.8 

0.8 

0.5 

{0.5,0.81 (4) 


(2,2) 

0.35 

0.8 

0.35 

0.5 

0.8 

0.8 

0.8 

0.5 

0.5 

0.5 

0.8 

0.5 

{0.5,0.8} (5) 


Table 18. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Gavin_cyc. The best value 
for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

qL 

Q7. 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1.1) 

0.05 

0.45 

0.5 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.45 

0.45 

0.45 

0.45 (5) 

SLPA 

(1,2) 

0.5 

0.45 

0.5 

0.45 

0.15 

0.5 

0.2 

0.5 

0.45 

0.45 

0.45 

0.45 

0.45 (6) 

(2,1) 

0.05 

0.45 

0.2 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.45 

0.45 

0.45 

0.45 (5) 


(2,2) 

0.45 

0.45 

0.2 

0.45 

0.15 

0.5 

0.2 

0.5 

0.45 

0.45 

0.45 

0.45 

0.45 (7) 


(1,1) 

3 

3 

3 

3 

3 

3 

3 

4 

3 

3 

3 

3 

3 (11) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 (12) 

(2,1) 

3 

4 

3 

5 

3 

3 

3 

13 

13 

18-20 

3 

4 

3 (6) 


(2,2) 

3 

4 

3 

5 

3 

3 

3 

11-20 

11-20 

11-20 

3 

4 

3 (6) 


(1,1) 

0.25 

0.95 

0.7 

0.85 

0.05 

1 

0.05 

0.7 

0.7 

0.7 

0.7 

0.7 

0.7 (6) 

SpeakEasy 

(1,2) 

0.7 

0.95 

0.7 

0.85 

0.7 

1 

1 

0.7 

0.7 

0.7 

0.7 

0.7 

0.7 (8) 

(2,1) 

0.1 

0.95 

0.7 

0.85 

0.05 

1 

0.05 

1 

0.7 

0.7 

0.7 

0.7 

0.7 (5) 


(2,2) 

0.7 

0.95 

0.7 

0.85 

0.7 

1 

1 

0.7 

0.7 

0.85 

0.7 

0.85 

0.7 (6) 


Table 19. The best value of r for SLPA and the best value of tr for SpeakEasy determined by the twelve com¬ 
munity quality metrics with four combinations of belonging coefficient and belonging function on Gavin_cyc_w. 
The best value for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

Qov 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.05 

0.4 

0.35 

0.3 

0.05 

0.35 

0.05 

0.5 

0.45 

0.35 

0.35 

0.35 

0.35 (5) 

SLPA 

(1,2) 

0.35 

0.4 

0.35 

0.3 

0.05 

0.35 

0.3 

0.45 

0.3 

0.3 

0.3 

0.3 

0.3 (6) 

(2,1) 

0.05 

0.45 

0.05 

0.3 

0.05 

0.35 

0.05 

0.5 

0.5 

0.35 

0.35 

0.45 

0.05 (4) 


(2,2) 

0.3 

0.45 

0.05 

0.3 

0.05 

0.35 

0.3 

0.45 

0.5 

0.5 

0.3 

0.3 

0.3 (5) 


(1,1) 

0.05 

0.4 

0.45 

0.65 

0.05 

0.65 

0.1 

0.95 

0.7 

0.7 

0.7 

0.7 

0.7 (4) 

SpeakEasy 

(1,2) 

0.45 

0.4 

0.45 

0.65 

0.7 

0.65 

0.7 

0.95 

0.45 

0.7 

0.7 

0.7 

0.7 (5) 

(2,1) 

0.05 

0.7 

0.45 

0.5 

0.05 

0.65 

0.1 

0.95 

0.95 

0.5 

0.7 

1 

{0.05,0.5,0.7,0.95} (2) 


(2,2) 

0.45 

0.7 

0.45 

0.65 

0.7 

0.65 

0.7 

0.95 

0.95 

0.9 

0.4 

1 

0.7 (3) 






























































































for more than two months processing many potential fc-cliques associated with intensity larger than the 
intensity threshold [B]. 

From Table M we could see that on Collins_cyc SLPA implies that the first version of belonging 
function is better than the second version, CFinder indicates that the first version of belonging coefficient 
is better than the second one, and SpeakEasy demonstrates that (BC,BF)=(1,2) is the best among all four 
combinations. In summary, there are two out of three algorithms show that (BC,BF)=(1,2) is the best 
on Collins_cyc. Also, SLPA in Table [T51 implies that (BC,BF)=(2,2) is the best, while SpeakEasy shows 
that (BC,BF)=(1,2) is the best on Collins_cyc_w. It can be seen from Table [TBl that all three algorithms 
support the conclusion that (BC,BF)=(1,2) is the best among the four combinations on Collins_mips. In 
addition on Collins_sgd iTable flTll . SLPA shows that (BC,BF)=(2,2) is the best, CFinder implies that 
the first version of belonging coefficient is better than the second version, and SpeakEasy indicates that 
(BC,BF)=(1,2) is the best among the four combinations of belonging coefficient and belonging function. 
Thus, two out of three algorithms conclude that (BC,BF)=(1,2) is the best among the four combinations 
on Collins_sgd. 

It can be observed from Table [TH] that on Ganvin_cyc SLPA shows that (BC,BF) = (2,2) is the 
best, while CFinder and SpeakEasy indicate that (BC,BF)=(1,2) is the best. Hence, two out of three 
algorithms support that (BC,BF)=(I,2) is the best among the four combinations on Ganvin_cyc. From 
Tables [T9][2TJ we could learn that all three algorithms show that (BC,BF) = (1,2) is the best among the 
four combinations on Ganvin_cyc_w, Ganvin_mips, and Ganvin_sgd. 

From the analysis above, we could conclude that (BC,BF) = (1,2) is the best among the four possible 
combinations of the two versions of belonging coefficient and two versions of belonging function on 
protein-protein interaction networks. 


Table 20. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Gavin_mips. The best value 
for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1.1) 

0.05 

0.5 

0.5 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 {11) 

(2,1) 

0.05 

0.5 

0.25 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(2,2) 

0.5 

0.45 

0.25 

0.5 

0.1 

0.4 

0.3 

0.5 

0.35 

0.5 

0.5 

0.5 

0.5 (6) 


(1,1) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 {12) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 {12) 

(2,1) 

3 

3 

3 

4 

3 

3 

3 

12 

12 

18-20 

3 

4 

3 (7) 


(2,2) 

3 

4 

3 

4 

3 

3 

3 

11-20 

11-20 

11-20 

3 

4 

3 (6) 


(1.1) 

0.05 

0.4 

0.9 

0.7 

0.05 

0.95 

0.1 

0.9 

0.9 

0.9 

0.9 

0.9 

0.9 (6) 

SpeakEasy 

(1.2) 

0.9 

0.4 

0.9 

0.7 

0.9 

0.95 

0.9 

0.4 

0.9 

0.9 

0.9 

0.9 

0.9 {8) 

(2,1) 

0.05 

0.95 

0.9 

0.7 

0.05 

0.95 

0.1 

0.9 

0.9 

0.9 

0.9 

0.4 

0.9 (5) 


(2,2) 

0.9 

0.95 

0.9 

0.7 

0.9 

0.35 

0.9 

0.4 

0.4 

0.4 

0.9 

0.4 

0.9 (5) 


Table 21. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value of 
threshold tr for SpeakEasy determined by the twelve community quality metrics with four possible combinations 
of the two versions of belonging coefficient and two version of belonging function on Gavin_sgd. The best value 
for subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

^OV 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.05 

0.5 

0.5 

0.5 

0.1 

0.35 

0.1 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.35 

0.5 

0.35 

0.5 

0.5 

0.5 

0.5 

0.5 {9) 

(2,1) 

0.05 

0.5 

0.15 

0.5 

0.1 

0.35 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2,2) 

0.5 

0.45 

0.15 

0.5 

0.1 

0.35 

0.5 

0.35 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(1,1) 

3 

3 

3 

4 

3 

3 

3 

3 

3 

3 

3 

3 

3 {11) 

CFinder 

(1,2) 

3 

3 

3 

4 

3 

3 

3 

3 

3 

3 

3 

3 

3 {11) 

(2,1) 

3 

4 

3 

5 

3 

3 

3 

13 

13 

13 

3 

4 

3 (6) 


(2,2) 

3 

4 

3 

5 

3 

3 

3 

11-17 

11-17 

11-17 

3 

4 

3 (6) 


(1,1) 

0.05 

0.8 

0.45 

0.5 

0.8 

0.8 

0.15 

0.7 

0.45 

0.8 

0.8 

0.45 

0.8 (5) 

SpeakEasy 

(1,2) 

0.45 

0.8 

0.45 

0.5 

0.8 

0.8 

0.8 

0.7 

0.45 

0.8 

0.8 

0.45 

0.8 {6) 

(2,1) 

0.05 

0.8 

0.8 

0.5 

0.8 

0.8 

0.15 

0.7 

0.45 

0.8 

0.8 

0.45 

0.8 {6) 


(2,2) 

0.45 

0.8 

0.8 

0.5 

0.8 

0.8 

0.8 

0.7 

0.45 

0.45 

0.8 

0.45 

0.8 {6) 




































































1.15 Amazon Product Network 


This is a product co-purchased network of the Amazon website [52]. If a product pi is frequently co¬ 
purchased with product pj, the graph contains an undirected edge from pi to pj. There are 319948 nodes 
and 880215 edges. Table 1551 shows the best value of threshold r for SLPA, the best value of parameter 
k for CFinder, and the best value of threshold tr for SpeakEasy determined by the twelve community 
quality metrics with four possible combinations of the two versions of belonging coefhcient and two 
version of belonging function on Amazon product network. We can see that all three algorithms show 
that (BC,BF)=(1,2) is the best among the four combinations of belonging coefficient and belonging 
function. 


Table 22. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve quality metrics with four combinations of the two versions 
of belonging coefficient and two version of belonging function on Amazon product network. The best value for 
subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Q OV 

NQov 

qL 

QT. 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.05 

0.5 

0.5 

0.5 

0.1 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.5 

0.4 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (10) 

(2,1) 

0.05 

0.5 

0.3 

0.45 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (6) 


(2,2) 

0.5 

0.5 

0.3 

0.5 

0.5 

0.4 

0.5 

0.4 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1,1) 

3 

3 

3 

3 

3 

3 

3 

7 

3 

3 

3 

3 

3 (11) 

CFinder 

(1,2) 

3 

3 

3 

3 

3 

3 

3 

5 

3 

3 

3 

3 

3 (11) 

(2,1) 

3 

4 

3 

4 

3 

3 

3 

7 

7 

7 

3 

5 

3 (6) 


(2,2) 

3 

4 

3 

4 

3 

3 

3 

7 

7 

7 

3 

4 

3 (6) 


(1,1) 

0.1 

1 

1 

1 

0.1 

1 

0.1 

1 

1 

1 

1 

1 

1 (9) 

SpeakEasy 

(1,2) 

1 

1 

1 

1 

1 

1 

1 

1 

1 

0.95 

1 

1 

1 (11) 

(2,1) 

0.1 

1 

0.95 

1 

0.1 

1 

0.1 

1 

1 

1 

1 

1 

1 (8) 


(2,2) 

1 

1 

0.95 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 (11) 


1.16 DBLP Collaboration Network 

The DBLP computer science bibliography provides a comprehensive list of research papers in computer 
science. In this DBLP co-authorship network, two authors are connected if they publish at least one paper 
together [22] . It has 260998 nodes and 950059 edges in total. Table |23l shows the best value of threshold 
r for SLPA, the best value of parameter k for CFinder, and the best value of threshold tr for SpeakEasy 
determined by the twelve community quality metrics with four possible combinations of the two versions 
of belonging coefficient and two version of belonging function on DBLP collaboration network. We can see 
that all three algorithms show that (BC,BF) = (1,2) is the best among the four combinations of belonging 
coefficient and belonging function. 


Table 23. The best value of threshold r for SLPA, the best value of parameter k for CFinder, and the best value 
of threshold tr for SpeakEasy determined by the twelve quality metrics with four combinations of the two versions 
of belonging coefficient and two version of belonging function on DBLP collaboration network. The best value for 
subcolumn of the last column is marked by red italic font. 


Algorithm 

(BC,BF) 

Qov 

NQov 

^ OV 

QZ 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1,1) 

0.05 

0.5 

0.5 

0.45 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 

SLPA 

(1,2) 

0.5 

0.5 

0.5 

0.5 

0.5 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (10) 

(2,1) 

0.05 

0.5 

0.3 

0.45 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (6) 


(2,2) 

0.5 

0.5 

0.3 

0.5 

0.15 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1,1) 

4 

3 

3 

3 

3 

3 

3 

18 

7 

3 

3 

3 

3 (9) 

CFinder 

(1,2) 

4 

3 

3 

3 

3 

3 

3 

9 

3 

3 

3 

3 

3 (10) 

(2,1) 

4 

5 

3 

6 

3 

3 

3 

20 

19 

20 

3 

16 

3 (5) 


(2,2) 

4 

5 

3 

5 

3 

3 

3 

20 

19 

20 

3 

9 

3 (5) 


(1,1) 

0.1 

1 

1 

1 

0.1 

1 

0.1 

1 

1 

1 

1 

1 

1 (9) 

SpeakEasy 

(1,2) 

1 

0.95 

1 

1 

1 

1 

0.95 

1 

1 

1 

1 

1 

1 (10) 

(2,1) 

0.1 

1 

0.95 

1 

0.1 

1 

0.1 

1 

1 

1 

1 

1 

1 (8) 


(2,2) 

1 

1 

0.95 

1 

1 

1 

0.95 

1 

1 

1 

1 

1 

1 (10) 


































































2 LFR Benchmark Networks 


LFR (named after the initials of names of authors) benchmark networks [15| have become a standard 
in the evaluation of the performance of community detection algorithms. The LFR benchmark network 
that we used here has 1000 nodes with average degree 15 and maximum degree 50. The exponent 7 for 
the degree sequence varies from 2 to 3. The exponent P for the community size distribution ranges from 
1 to 2. Then, four pairs of the exponents (7,/3) = (2,1), (2,2), (3,1), and (3,2) are chosen in order to 
explore the widest spectrum of graph structures. The mixing parameter fi is varied from 0.05 to 0.95. 
It means that each node shares a fraction (1 — ^) of its edges with the other nodes in its community 
and shares a fraction fi of its edges with the nodes outside its community. Thus, low mixing parameters 
indicate strong community structure. The degree of overlap is determined by two parameters. is 
the number of overlapping nodes, and Om is the number of communities to which each overlapping 
node belongs. 0„ here is set to 10% of the total number of nodes. Instead of fixing we allow it to 
vary from 1 to 8 indicating the overlapping diversity of overlapping nodes. By increasing the value of 
Om, we create harder detection tasks. Also, we generate 10 network instances for each configuration of 
these parameters. Hence, each metric value for a certain configuration of LFR represents the average 
metric values of all 10 instances. Since the experimental results are similar for all four pairs of exponents 
( 7 , P) = (2,1), (2, 2), (3,1), and (3,2), for the sake of brevity, we only present the results for ( 7 , P) = (2,1) 
here. In addition, there results are similar for different values of p,, so here we only show the results 
for ^ = 0.3,0.35, and 0.4. We choose /r = 0.3, 0.35, and 0.4 to better illustrate the results since with 
= 0.3, 0.35, and 0.4 the community structures generated by LFR are around the boundary of well- 
separated communities and well-connected communities. For each node, ^ = 0.5 means that the number 
of its edges with other nodes in its communities is equal to the number of its edges with nodes outside 
its community, which makes the community structure difficult to discover. 

Tables [Mll26l respectively show the best value of threshold r for SLPA, the best value of parameter 
k for CFinder, and the best value of threshold tr for SpeakEasy determined by the twelve community 
quality metrics with four possible combinations of the two versions of belonging coefficient and two version 
of belonging function on LFR benchmark networks with {a, P) = (1,2) and ^ = 0.3,0.35,0.4. Table [Ml 
implies that (BC,BF)=(1,2) is the best among the four possible combinations on all configurations of 
LFR benchmark networks except fi = 0.3, Om = 2 and p, = 0.4, Om = 1 when using SLPA. Table [25] 
demonstrates that (BC,BF)=(1,2) is the best on all configurations of LFR benchmark networks when 
using CFinder. Table indicates that (BC,BF)=(1,2) is the best among the four combinations on all 
configurations of LFR benchmark networks except fi = 0.35, Om = 6 and /i = 0.4, Om = 2,4 when using 
SpeakEasy. Consequently, we could conclude that the overlapping community quality metrics with the 
first version of belonging coefficient and the second version of the belonging function are the best among 
the four possible combinations on LER benchmark networks. 
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Table 24. The best value of threshold r for SLPA determined by the twelve community quality metrics with four 
possible combinations of the two versions of belonging coefficient and two version of belonging function on LFR 
benchmark networks with (a, /?) = (1, 2) and /r = 0.3, 0.35, 0.4. The best value for subcolumn of the last column 
is marked by red italic font. 


Om 

(BC,BF) 

Q OV 

NQov 

qL 

Q7s 

IE 

ID 

CNT 

BE 

EXP 

CND 

F 

D 



(1.1) 

0.3 

0.5 

0.5 

0.5 

0.05 

0.45 

0.05 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.05 

0.45 

0.2 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 

1 

(2.1) 

0.3 

0.5 

0.2 

0.5 

0.05 

0.5 

0.05 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.5 

0.5 

0.2 

0.5 

0.05 

0.5 

0.2 

0.45 

0.5 

0.2 

0.2 

0.5 

0.5 (6) 


(1.1) 

0.05 

0.5 

0.5 

0.25 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.3 

0.5 

0.3 

0.45 

0.4 

0.3 

0.3 

0.4 

0.5 (5) 

2 

(2.1) 

0.05 

0.5 

0.3 

0.25 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.4 

0.5 

0.3 

0.4 

0.15 

0.5 

0.3 

0.45 

0.35 

0.3 

0.3 

0.35 

0.3 (4) 


(1.1) 

0.05 

0.5 

0.5 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.15 

0.5 

0.5 

0.5 

0.4 

0.5 

0.5 

0.5 

0.5 {10) 

4 

(2.1) 

0.05 

0.5 

0.3 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(2.2) 

0.45 

0.5 

0.3 

0.5 

0.15 

0.5 

0.35 

0.5 

0.4 

0.4 

0.5 

0.4 

0.5 (5) 


(1.1) 

0.05 

0.5 

0.5 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.5 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 {10) 

6 

(2.1) 

0.05 

0.5 

0.3 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(2.2) 

0.4 

0.5 

0.3 

0.45 

0.1 

0.5 

0.2 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (6) 


(1.1) 

0.1 

0.5 

0.5 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.5 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 {10) 

8 

(2.1) 

0.1 

0.5 

0.2 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(2.2) 

0.5 

0.5 

0.2 

0.5 

0.1 

0.5 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.1) 

0.25 

0.5 

0.25 

0.4 

0.05 

0.5 

0.15 

0.5 

0.5 

0.25 

0.25 

0.25 

0.25 (5) 


(1.2) 

0.25 

0.5 

0.25 

0.4 

0.1 

0.45 

0.15 

0.45 

0.25 

0.25 

0.25 

0.25 

0.25 {6) 

1 

(2.1) 

0.2 

0.5 

0.15 

0.4 

0.05 

0.45 

0.05 

0.5 

0.25 

0.25 

0.25 

0.25 

0.25 (4) 


(2.2) 

0.25 

0.5 

0.15 

0.4 

0.05 

0.45 

0.15 

0.45 

0.25 

0.25 

0.25 

0.25 

0.25 (5) 


(1.1) 

0.05 

0.5 

0.45 

0.3 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(1.2) 

0.45 

0.5 

0.45 

0.45 

0.15 

0.45 

0.3 

0.45 

0.45 

0.3 

0.3 

0.45 

0.45 (7) 

2 

(2.1) 

0.05 

0.5 

0.3 

0.25 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.45 

0.5 

0.3 

0.45 

0.15 

0.45 

0.15 

0.45 

0.45 

0.3 

0.3 

0.45 

0.45 (6) 


(1.1) 

0.05 

0.5 

0.5 

0.5 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.15 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 {9) 

4 

(2.1) 

0.05 

0.5 

0.3 

0.5 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.5 

0.5 

0.3 

0.45 

0.1 

0.45 

0.5 

0.45 

0.45 

0.5 

0.5 

0.5 

0.5 (6) 


(1.1) 

0.05 

0.5 

0.5 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 {9) 

6 

(2.1) 

0.05 

0.5 

0.25 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.5 

0.5 

0.25 

0.5 

0.1 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.1) 

0.35 

0.5 

0.5 

0.45 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.05 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 {9) 

8 

(2.1) 

0.4 

0.5 

0.2 

0.45 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (6) 


(2.2) 

0.5 

0.5 

0.2 

0.5 

0.05 

0.45 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.1) 

0.25 

0.5 

0.5 

0.45 

0.05 

0.45 

0.1 

0.5 

0.5 

0.25 

0.5 

0.5 

0.5 {6) 


(1.2) 

0.5 

0.45 

0.5 

0.45 

0.1 

0.4 

0.25 

0.4 

0.25 

0.25 

0.25 

0.25 

0.25 (5) 

1 

(2.1) 

0.2 

0.5 

0.2 

0.45 

0.05 

0.45 

0.05 

0.5 

0.5 

0.25 

0.25 

0.5 

0.5 (4) 


(2.2) 

0.45 

0.45 

0.2 

0.45 

0.1 

0.4 

0.2 

0.4 

0.25 

0.2 

0.25 

0.25 

{0.2,0.25,0.45} (3) 


(1.1) 

0.05 

0.5 

0.5 

0.3 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.15 

0.5 

0.5 

0.4 

0.45 

0.5 

0.5 

0.5 

0.5 {9) 

2 

(2.1) 

0.05 

0.5 

0.3 

0.3 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.45 

0.5 

0.3 

0.5 

0.15 

0.4 

0.5 

0.35 

0.45 

0.5 

0.5 

0.45 

0.5 (5) 


(1.1) 

0.1 

0.5 

0.5 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.2 

0.5 

0.5 

0.4 

0.5 

0.5 

0.5 

0.5 

0.5 {10) 

4 

(2.1) 

0.1 

0.5 

0.3 

0.5 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(2.2) 

0.5 

0.5 

0.3 

0.5 

0.15 

0.5 

0.5 

0.4 

0.45 

0.5 

0.5 

0.5 

0.5 (8) 


(1.1) 

0.5 

0.5 

0.5 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.05 

0.5 

0.5 

0.45 

0.5 

0.5 

0.5 

0.5 

0.5 {10) 

6 

(2.1) 

0.5 

0.5 

0.5 

0.45 

0.05 

0.5 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(2.2) 

0.5 

0.5 

0.5 

0.5 

0.05 

0.45 

0.5 

0.4 

0.5 

0.5 

0.5 

0.5 

0.5 (9) 


(1.1) 

0.5 

0.5 

0.5 

0.4 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


(1.2) 

0.5 

0.5 

0.5 

0.5 

0.1 

0.45 

0.5 

0.4 

0.5 

0.5 

0.5 

0.5 

0.5 {9) 

8 

(2.1) 

0.5 

0.5 

0.25 

0.4 

0.05 

0.45 

0.05 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 (7) 


(2.2) 

0.5 

0.5 

0.25 

0.5 

0.05 

0.4 

0.5 

0.4 

0.5 

0.5 

0.5 

0.5 

0.5 (8) 


0.4 



















































































Table 25. The best value of parameter k for CFinder determined by the twelve community quality metrics with 


four possible combinations of the two versions of belonging coefficient and two version of belonging function on 
LFR benchmark networks with {a, 13) = (1,2) and /r = 0.3,0.35,0.4. The best value for subcolumn of the last 


column is marked by red italic font. 


Om 

(BC,BF) 

Qov 

NQov 


Qs: 

IS 

ID 

CNT 

BE 

EXP 

CND 

T 

D 


1 

(1.1) 

4 

4 

4 

4 

3 

3 

4 

10 

4 

4 

4 

4 

4 (9) 

(1.2) 

4 

4 

4 

4 

3 

3 

4 

10 

4 

4 

4 

4 

4 (9) 

(2.1) 

4 

5 

4 

4 

3 

3 

4 

10 

10 

4 

4 

4 

4(7) 

(2.2) 

4 

5 

4 

4 

3 

3 

4 

10 

10 

10 

4 

4 

4(6) 

2 

(1.1) 

4 

5 

4 

4 

3 

3 

3 

9 

4 

3 

3 

4 

{3.4} (5) 

(1.2) 

4 

5 

4 

4 

3 

3 

4 

9 

4 

3 

3 

4 

4 (6) 

(2.1) 

4 

5 

4 

4 

3 

3 

3 

9 

9 

3 

3 

4 

3 (5) 

(2.2) 

4 

6 

4 

4 

3 

3 

4 

9 

9 

9 

3 

4 

4(5) 

4 

(1.1) 

4 

5 

4 

4 

3 

3 

3 

12 

4 

3 

3 

3 

3 [6) 

(1.2) 

4 

5 

4 

4 

3 

3 

3 

12 

4 

3 

3 

3 

3 {6) 

(2.1) 

4 

6 

4 

4 

3 

3 

3 

12 

12 

12 

3 

9 

3 (4) 

(2.2) 

4 

6 

4 

4 

3 

3 

3 

12 

12 

12 

3 

7 

3 (4) 

6 

(1.1) 

4 

5 

4 

4 

3 

3 

4 

11 

5 

3 

3 

3 

3 (5) 

(1.2) 

4 

5 

4 

4 

3 

3 

4 

11 

4 

4 

4 

3 

4 (7) 

(2.1) 

4 

7 

4 

5 

3 

3 

4 

11 

11 

11 

3 

7 

{3,4,11} (3) 

(2.2) 

4 

7 

4 

5 

3 

3 

4 

11 

11 

11 

4 

7 

4(4) 

8 

(1.1) 

4 

5 

4 

4 

3 

3 

4 

12 

6 

3 

3 

3 

3 (5) 

(1.2) 

4 

5 

4 

4 

3 

3 

4 

12 

4 

4 

4 

3 

4 (7) 

(2.1) 

4 

6 

4 

5 

3 

3 

4 

12 

12 

12 

3 

7 

{3,4,12} (3) 

(2.2) 

4 

6 

4 

5 

3 

3 

4 

12 

12 

12 

4 

6 

4(4) 

1 

(1.1) 

4 

4 

4 

4 

3 

3 

3 

10 

4 

4 

4 

4 

4 («) 

(1.2) 

4 

4 

4 

4 

3 

3 

4 

10 

4 

3 

4 

4 

4 («) 

(2.1) 

4 

4 

4 

4 

3 

3 

3 

10 

10 

10 

4 

4 

4(6) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

10 

10 

10 

3 

4 

4(5) 

2 

(1.1) 

4 

4 

4 

4 

3 

3 

3 

8 

4 

3 

3 

3 

3 {6) 

(1.2) 

4 

4 

4 

4 

3 

3 

3 

8 

4 

3 

3 

3 

3 (6) 

(2.1) 

4 

5 

4 

4 

3 

3 

3 

8 

8 

8 

3 

8 

{3,8} (4) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

8 

8 

8 

3 

7 

4(4) 

4 

(1.1) 

4 

5 

4 

4 

3 

3 

3 

10 

4 

3 

3 

3 

3 (6) 

(1.2) 

4 

4 

4 

4 

3 

3 

4 

10 

4 

3 

3 

3 

4 (6) 

(2.1) 

4 

6 

4 

4 

3 

3 

3 

10 

10 

10 

3 

9 

3 (4) 

(2.2) 

4 

8 

4 

4 

3 

3 

4 

10 

10 

10 

3 

7 

4(4) 

6 

(1.1) 

4 

5 

4 

4 

3 

3 

4 

11 

5 

4 

3 

3 

4(5) 

(1.2) 

4 

5 

4 

4 

3 

3 

4 

11 

4 

4 

4 

3 

4 (7) 

(2.1) 

4 

6 

4 

4 

3 

3 

4 

11 

11 

11 

4 

8 

4(5) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

11 

11 

11 

4 

7 

4(5) 

8 

(1.1) 

4 

5 

4 

4 

3 

3 

4 

11 

6 

4 

4 

3 

4(6) 

(1.2) 

4 

5 

4 

4 

3 

3 

4 

11 

4 

4 

4 

3 

4 (7) 

(2.1) 

4 

6 

4 

5 

3 

3 

4 

11 

11 

11 

4 

6 

4(4) 

(2.2) 

4 

6 

4 

4 

3 

3 

4 

11 

11 

11 

4 

6 

4(5) 

1 

(1.1) 

4 

4 

4 

4 

3 

3 

3 

8 

4 

3 

3 

4 

4 (6) 

(1.2) 

4 

4 

4 

4 

3 

3 

3 

8 

4 

3 

3 

3 

3 (6) 

(2.1) 

4 

4 

4 

4 

3 

3 

3 

8 

8 

8 

3 

8 

{3,4,8} (4) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

8 

8 

8 

3 

7 

4(4) 

2 

(1.1) 

4 

4 

4 

4 

3 

3 

3 

9 

4 

3 

3 

3 

3 (6) 

(1.2) 

4 

4 

4 

4 

3 

3 

3 

9 

4 

3 

3 

3 

3 (6) 

(2.1) 

4 

5 

4 

4 

3 

3 

3 

9 

9 

9 

3 

9 

{3,9} (4) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

9 

9 

9 

3 

7 

4(4) 

4 

(1.1) 

4 

4 

4 

4 

3 

3 

3 

9 

5 

3 

3 

3 

3 (6) 

(1.2) 

4 

4 

4 

4 

3 

3 

4 

9 

4 

3 

3 

3 

4 (6) 

(2.1) 

4 

5 

4 

4 

3 

3 

3 

9 

9 

9 

3 

9 

{3,9} (4) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

9 

9 

9 

3 

7 

4(4) 

6 

(1.1) 

4 

4 

4 

4 

3 

3 

4 

10 

5 

4 

4 

3 

4(7) 

(1.2) 

4 

4 

4 

4 

3 

3 

4 

10 

4 

4 

4 

3 

4 («) 

(2.1) 

4 

5 

4 

4 

3 

3 

4 

10 

10 

10 

4 

10 

4(5) 

(2.2) 

4 

7 

4 

4 

3 

3 

4 

10 

10 

10 

4 

7 

4(5) 

8 

(1.1) 

4 

5 

4 

4 

3 

3 

4 

10 

5 

4 

4 

3 

4(6) 

(1.2) 

4 

5 

4 

4 

3 

3 

4 

10 

4 

4 

4 

3 

4 (7) 

(2.1) 

4 

5 

4 

4 

3 

3 

4 

10 

10 

10 

4 

9 

4(5) 

(2.2) 

4 

6 

4 

4 

3 

3 

4 

10 

10 

10 

4 

6 

4(5) 



















































































Table 26. The best value of threshold tr for SpeakEasy determined by the twelve community quality metrics 


with four possible combinations of the two versions of belonging coefficient and two version of belonging function 
on LFR benchmark networks with (a, /?) = (1, 2) and /r = 0.3, 0.35, 0.4. The best value for subcolumn of the last 


column is marked by red italic font. 


Om 

(BC,BF) 

Q OV 

NQov 

qL 

Q2 

IE 

ID 

CNT 

BE 

EXP 

CND 

E 

D 



(1,1) 

0.75 

0.2 

0.75 

0.75 

0.75 

0.8 

0.75 

0.8 

0.75 

0.75 

0.75 

0.75 

0.75 (9) 


(1.2) 

0.75 

0.2 

0.75 

0.75 

0.75 

0.8 

0.75 

0.8 

0.75 

0.75 

0.75 

0.75 

0.75 (9) 

1 

(2,1) 

0.75 

0.2 

0.75 

0.75 

0.75 

0.8 

0.75 

0.8 

0.75 

0.75 

0.75 

0.75 

0.75 (9) 


(2,2) 

0.75 

0.2 

0.75 

0.75 

0.75 

0.8 

0.75 

0.8 

0.75 

0.75 

0.75 

0.75 

0.75 (9) 


(1.1) 

0.05 

0.85 

0.8 

0.6 

0.8 

0.4 

0.8 

0.9 

0.8 

0.8 

0.8 

0.8 

0.8 (7) 


(1,2) 

0.8 

0.85 

0.8 

0.6 

0.8 

0.25 

0.8 

0.4 

0.8 

0.8 

0.8 

0.8 

0.8 {8) 

2 

(2,1) 

0.05 

0.85 

0.8 

0.6 

0.8 

0.4 

0.8 

0.9 

1 

0.8 

0.8 

0.8 

0.8 (6) 


(2,2) 

0.8 

0.85 

0.8 

0.6 

0.8 

0.25 

0.8 

0.4 

1 

0.8 

0.8 

0.8 

0.8 (7) 


(1,1) 

0.05 

1 

0.7 

0.95 

0.05 

1 

0.05 

1 

1 

0.8 

0.8 

0.95 

1 U) 


(1,2) 

0.7 

1 

0.7 

0.95 

0.2 

1 

0.6 

1 

1 

0.65 

0.65 

0.6 

1 U) 

4 

(2,1) 

0.05 

1 

0.7 

0.95 

0.05 

1 

0.05 

1 

1 

0.8 

0.8 

0.95 

1 U) 


(2,2) 

0.25 

1 

0.7 

0.45 

0.2 

1 

0.6 

1 

1 

0.45 

0.65 

0.2 

1 U) 


(1.1) 

0.05 

0.85 

0.5 

1 

0.05 

0.65 

0.05 

0.7 

0.95 

0.95 

0.95 

0.95 

0.95 (4) 


(1.2) 

0.35 

0.85 

0.5 

0.5 

0.35 

0.65 

0.35 

0.6 

0.95 

0.95 

0.95 

0.95 

0.95 (4) 

6 

(2,1) 

0.05 

0.85 

0.35 

1 

0.05 

0.65 

0.05 

0.7 

0.7 

0.95 

0.95 

1 

0.05 (3) 


(2.2) 

0.35 

0.85 

0.35 

0.5 

0.35 

0.65 

0.35 

0.7 

0.7 

0.7 

0.95 

0.7 

{0.55,0.7} (4) 


(1.1) 

0.05 

1 

0.4 

0.85 

0.05 

0.95 

0.05 

0.9 

0.9 

0.6 

0.75 

1 

0.05 (3) 


(1.2) 

0.4 

1 

0.4 

0.85 

0.4 

0.95 

0.4 

0.35 

0.6 

0.6 

0.6 

0.6 

10.4,0.6} (4) 

8 

(2,1) 

0.05 

1 

0.4 

0.85 

0.05 

0.95 

0.05 

0.9 

0.85 

0.6 

0.75 

0.85 

{0.05,0.85} (3) 


(2,2) 

0.35 

1 

0.4 

0.85 

0.4 

0.95 

0.4 

0.35 

0.85 

0.7 

0.6 

0.85 

{0.4,0.85} (3) 


(1,1) 

0.8,0.9 

0.75 

0.8 

0.8,0.9 

0.8,0.9 

0.45 

0.8,0.9 

0.45 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8 (9) 


(1,2) 

0.8,0.9 

0.75 

0.8 

0.8,0.9 

0.8,0.9 

0.45 

0.8,0.9 

0.45 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8 (9) 

1 

(2,1) 

0.8,0.9 

0.75 

0.8 

0.8,0.9 

0.8,0.9 

0.45 

0.8,0.9 

0.45 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8 (9) 


(2,2) 

0.8,0.9 

0.75 

0.8 

0.8,0.9 

0.8,0.9 

0.45 

0.8,0.9 

0.45 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8,0.9 

0.8 (9) 


(1,1) 

0.05 

0.85 

0.95 

1 

0.05 

0.9 

0.05 

0.9 

0.95 

0.95 

0.95 

0.95 

0.95 (5) 


(1,2) 

0.95 

0.85 

0.95 

1 

0.35 

0.9 

0.95 

0.5 

0.75 

0.95 

0.95 

0.95 

0.95 (6) 

2 

(2,1) 

0.05 

0.85 

0.35 

1 

0.05 

0.9 

0.05 

0.9 

0.95 

0.95 

0.95 

0.95 

0.95 (4) 


(2,2) 

0.35 

0.85 

0.35 

1 

0.35 

0.9 

0.95 

0.5 

0.75 

0.75 

0.95 

0.95 

{0.35,0.95} (3) 


(1,1) 

0.05 

0.95 

0.9 

1 

0.05 

0.95 

0.05 

0.95 

0.9 

0.75 

0.75 

1 

{0.05,0.95} (3) 


(1.2) 

0.9 

0.95 

0.9 

1 

0.65 

0.95 

0.75 

0.95 

0.9 

0.75 

0.75 

0.75 

0.75 {4} 

4 

(2,1) 

0.05 

0.95 

0.65 

1 

0.05 

0.95 

0.05 

0.95 

0.9 

0.75 

0.75 

0.9 

{0.05,0.95} (3) 


(2,2) 

0.9 

0.95 

0.65 

1 

0.65 

0.95 

0.75 

0.95 

0.6 

0.6 

0.75 

0.9 

0.95 (3) 


(1.1) 

0.05 

1 

0.8 

0.85 

0.05 

1 

0.05 

1 

0.85 

0.85 

0.85 

0.85 

0.85 (5) 


(1.2) 

0.8 

0.95 

0.8 

0.45 

0.3 

1 

0.3 

0.1 

0.85 

0.85 

0.85 

0.85 

0.85 (4) 

6 

(2.1) 

0.05 

1 

0.45 

0.85 

0.05 

1 

0.05 

1 

0.9 

0.85 

0.85 

0.85 

0.85 (4) 


(2,2) 

0.45 

0.95 

0.45 

0.45 

0.3 

1 

0.3 

0.35 

0.9 

0.9 

0.85 

0.85 

0.45 (3) 


(1,1) 

0.05 

1 

0.9 

0.8 

0.05 

0.85 

0.05 

1 

0.95 

0.85 

0.85 

0.9 

{0.05,0.85} (3) 


(1.2) 

0.9 

1 

0.9 

0.8 

0.9 

0.85 

0.85 

0.3 

0.9 

0.85 

0.85 

0.9 

0.9 (5) 

8 

(2.1) 

0.05 

1 

0.9 

0.8 

0.05 

0.85 

0.05 

1 

1 

0.85 

0.85 

0.95 

{0.05,0.85,1} (3) 


(2.2) 

0.9 

1 

0.9 

0.8 

0.9 

0.85 

0.85 

0.3 

0.2 

1 

0.85 

0.95 

{0.85,0.9} (3) 


(1,1) 

0.15 

0.75 

0.15 

0.15 

0.15 

0.75 

0.15 

0.75 

0.15 

0.15 

0.15 

0.15 

0.15 (9) 


(1,2) 

0.15 

0.2 

0.15 

0.15 

0.15 

0.75 

0.15 

0.75 

0.15 

0.15 

0.15 

0.15 

0.15 (9) 

1 

(2,1) 

0.15 

0.75 

0.15 

0.15 

0.15 

0.75 

0.15 

0.75 

0.15 

0.15 

0.15 

0.15 

0.15 (9) 


(2,2) 

0.15 

0.2 

0.15 

0.15 

0.15 

0.75 

0.15 

0.75 

0.15 

0.15 

0.15 

0.15 

0.15 (9) 


(1.1) 

0.05 

0.85 

0.95 

0.8 

0.1 

0.85 

0.05 

0.85 

0.95 

0.8 

0.8 

0.95 

{0.8,0.85,0.95} (3) 


(1,2) 

0.95 

0.85 

0.95 

0.75 

0.8 

0.85 

0.75 

0.85 

0.95 

0.8 

0.8 

0.95 

0.95 (4) 

2 

(2,1) 

0.05 

0.85 

0.95 

0.75 

0.05 

0.85 

0.05 

0.85 

0.95 

0.8 

0.8 

0.95 

{0.05,0.85,0.95} (3) 


(2,2) 

0.95 

0.85 

0.95 

0.75 

0.8 

0.85 

0.75 

0.6 

0.95 

0.95 

0.8 

0.95 

0.95 (5) 


(1,1) 

0.05 

1 

0.95 

0.9 

0.15 

1 

0.05 

0.9 

0.95 

0.95 

0.95 

0.95 

0.95 (5) 


(1,2) 

0.95 

1 

0.95 

0.9 

0.5 

1 

0.95 

0.9 

0.75 

0.95 

0.95 

0.95 

0.95 (6) 

4 

(2,1) 

0.05 

1 

0.95 

0.9 

0.05 

1 

0.05 

0.9 

0.9 

0.95 

0.95 

0.95 

0.95 (4) 


(2,2) 

0.95 

0.9 

0.95 

0.9 

0.5 

1 

0.95 

0.9 

0.7 

0.7 

0.95 

0.7 

0.95 (4) 


(1.1) 

0.1 

0.95 

0.75 

0.75 

0.15 

0.95 

0.1 

0.95 

0.95 

0.9 

0.9 

0.8 

0.95 (4) 


(1.2) 

0.75 

0.95 

0.75 

0.75 

0.15 

0.95 

0.65 

0.45 

0.8 

0.65 

0.65 

0.8 

{0.65,0.75} (3) 

6 

(2,1) 

0.05 

0.95 

0.75 

0.75 

0.05 

0.95 

0.1 

0.95 

0.75 

0.9 

0.9 

0.75 

0.75 (4) 


(2,2) 

0.75 

0.95 

0.75 

0.75 

0.15 

0.95 

0.65 

0.45 

0.75 

0.75 

0.65 

0.75 

0.75 (6) 


(1.1) 

0.05 

1 

0.9 

0.8 

0.05 

1 

0.05 

1 

0.9 

0.95 

0.95 

0.9 

{0.05,0.9,1} (3) 


(1,2) 

0.9 

1 

0.9 

0.8 

0.4 

1 

0.4 

0.45 

0.9 

0.4 

0.4 

0.4 

0.4 (5) 

8 

(2,1) 

0.05 

1 

0.9 

0.8 

0.05 

1 

0.05 

1 

0.8 

0.95 

0.95 

0.95 

{0.05,0.95,1} (3) 


(2,2) 

0.9 

1 

0.9 

0.8 

0.4 

1 

0.4 

0.45 

0.5 

0.5 

0.4 

0.5 

{0.4,0.5} (3) 
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